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T HE National Security Agency, established by 
presidential directive, is the national authority 
for all United States communications security 
activities. Its headquarters building is located 
between Washington, D. C. and Baltimore, 
Maryland. The Agency conducts its own recruiting 
and employment programs to hire promising in¬ 
dividuals from all sections of the country. 

NS A offers interesting and useful careers in many 
of the intellectual disciplines. People trained in 
languages and linguistics find the Agency a con¬ 
genial place in which to work and in which to 
increase their knowledge and use their skills. Those 
trained in certain modern languages and area 
studies may use their skills as basic tools for re¬ 
search in a number of analytical fields. A high 
degree of flexibility and ingenuity is required be¬ 
cause of the varied nature of the work. Systematic 
accumulation of information, data examination, and 
the preparation of special reports are important 
products of this research. The ability to read in a 
wide variety of subjects and to translate into good 
English is important. Fluency in speaking is not 
essential, but knowledge of idiomatic and colloquial 
speech is desirable. Those with an exceptional 
facility in languages are often assigned immediately 
to linguistic duties or are selected for training in 
an entirely new language. Scientific linguists have 
an opportunity to engage in a variety of practical 
applications of linguistics in many complex assign¬ 
ments. 

If you wish to learn more about employment 
opportunities at NS A write to: Chief, College 
Relations Branch, National Security Agency, Fort 
George G. Meade, Maryland 20755, Attn.: M321. 
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THE NS A 

TECHNICAL JOURNAL 


SPECIAL LINGUISTICS ISSUE II 


This special issue of the NS A Technical Journal offers a sampling 
of the scholarly output of this Agency’s linguists. As the National 
center for our country’s communications security activities, the Na¬ 
tional Security Agency has to deal with a wide range of technical prob¬ 
lems. Many of these require the services of professionals who possess 
abilities in foreign languages or in scientific linguistics. A major pur¬ 
pose of this publication is to draw the attention of interested readers 
outside the Agency to the opportunity of a career with NS A, utilizing 
and enlarging their skills in foreign languages and linguistics. 

Examples of the tasks to be performed include the following: trans¬ 
lation or summarization in English of technical foreign language ma¬ 
terial; transcription of foreign speech, or English summarization; 
preparation of studies, reports, etc. based on material in one or more 
foreign languages; preparation of linguistic aids, such as glossaries or 
more specialized products of language analysis. Other applications 
of linguistic talent may be needed in such areas as speech communi¬ 
cations, computer applications to linguistic tasks and, not least, 
teaching of subjects in the fields of foreign language or linguistics in 
the Agency’s School. Specific assignments may be highly specialized 
and may include uniquely rewarding research projects. 

The articles which follow will serve to show to some extent the variety 
of interests which attract NSA’s linguists. All find their duties chal¬ 
lenging; many consider them quite captivating. 


The Guest Editor for this special issue of the NSA Technical Journal is Dr. 
James R. Nielson, Chief of the Department of Intelligence and Language, Na¬ 
tional Cryptologic School. 
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Handle With Care: Russian Transliteration 


BY A. J. SALEMME 


The translator, who is so often faced with the problem of how to 
translate a specific word into another language, sometimes encounters 
words which do not have to be translated at all. Is it obvious that, 
in this day of live international telecasts, names of people and places, 
for example, are usually left “as is”? Actually, between any pair of 
languages there are various conventions, changing over the years, 
which govern the treatment of names and other borrowed words. 
Accordingly there are various “traditional” treatments which some¬ 
times defy simple logic. English speakers generally say “Rome, 
Naples” (not “Roma, Napoli”), but vacillate between “Leghorn” 
and “Livorno”; read “Louis XVI” as “Louis the Sixteenth” instead 
of the antique trade’s “Louis Seize”; use “Rive Gauche” and “Left 
Bank” almost interchangeably; but seem to prefer “Champs Elys^es, 
Moulin Rouge” to “Elysian Fields, Red Mill.” Germans say 
“Mailand, Venedig” (not “Milan, Venice” or even “Milano, Vene¬ 
zia”), and the French say “Londres” (not “London”). But if the 
translator is aware of the relatively infrequent “traditional” treat¬ 
ments, he is usually safe in retaining “as is” personal and place names, 
as well as nouns and other words for which there is no precise equiva¬ 
lent in the language into which he is translating. “Guillaume, 
Colombey-les-deux-Eglises, sang froid” remain unchanged, instead of 
being translated, or almost-translated, as “William, Colombey-with- 
the-two-churches, cold blood.” “Heinrich, Neustadt, gemiitlich” are 
not “Henry, Newton, nice.” 

Of course, an additional problem arises when such words are bor¬ 
rowed from a language having a different writing system. Since, for 
example, Russian letters (like Japanese, Arabic, Burmese, or other 
non-Latin characters) are often completely unintelligible to the 
average American reader, or, even if they are intelligible, pose typo¬ 
graphical difficulties, the words cannot remain entirely “as is” 1 , but 
have to be transliterated into Latin letters. Thus, HBaH Ky 3 Heu, 0 B, 


Except when some purpose is served by retaining the actual shape of the 
letter, as when an entire mystery novel hinges upon a single clue, such as a hand¬ 
kerchief which presumably belongs to the heroine (“The monogrammed H could 
only stand for ‘Helen”’) but proves on the last page to belong to the completely 
unsuspected Russian ex-countess HaTaJiBH IleTpOBHa. 
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TRANSLITERATION 


Kphboh Por, cojiOHnaK become “Ivan Kuznetsov, Krivoj Rog, solon- 
chak” (but not “John Smith, Crooked Horn, salty soil”) 2 . 

Unfortunately, transliteration is not as simple as sending in a 
quarter to the cereal company and being the first in the neighborhood 
to get a “Magic Message Decoder” with the beautifully one-for-one 
ability to read messages like: 

XI CROOA 

KB BOX 

What, then, are some of the problems making simple one-for-one 
transliteration from Russian difficult? In order to answer that 


2 Translation of personal and place names is by no means impossible, al¬ 
though the results range from the confusing to the ludicrous. Britishers, for 
example, who would be among the last to translate the name of a Frenchman 
‘‘Pierre Boulanger” as “Peter Baker,” apparently do not mind translating the 
first names of everyday Russians (calling them “Constantine, Peter, Nicholas” 
instead of “Konstantin, Petr, Nikolaj”). Perhaps this is an extension of the 
common British treatment of Russian historical names (“John Moneybags” in¬ 
stead of “Ivan Kalita,” “John the Dread” instead of “Ivan IV,” “Ivan Groznyj,” 
or “Ivan the Terrible”). Nowadays, Russian last names are not translated by 
anyone, British or otherwise. True, Russian names often lend themselves to 
plays on words, such as that on “Scratchy” Tsarapkin’s last name, but only a 
crude machine-translation program would actually translate XpymeB as “cock¬ 
chafers” or Bopnc EohkO as “Boris briskly/smartly” (the latter sounds somewhat 
like a Saki character, what?). Nor is mistreatment of place names by trans¬ 
lating them impossible. There have been a few legendary instances such as the 
well-known flub of the German General Staff in misinterpreting a report from an 
agent in Spain to read that Churchill and Roosevelt would be meeting in “the 
White House,” instead of where the man said, in “Casablanca.” But most of 
the incorrect treatments of place names fall into the same internationally less 
critical class as the treatments by those machine-translation programs which, at 
their own risk, chop up into translatable chunks any new words the machine en¬ 
counters. Item: the machine translation of AHTHKHTepa, the Russian spelling 
of Antikythera, a Greek island, as “antiwhaleparadise.” 

3 To the person just learning Russian, the Cyrillic letters may look as weird 
as the characters in the cereal box “codes.” Accordingly he may hope to memo¬ 
rize the alphabet, with or without mnemonic devices like “JI has a hook like an 
L-ephant’s trunk.” His “Magic Message Decoder” easily changes BepJMH and 
JIohjjoh to “Berlin” and “London,” confirming his guess that it’s just one step 
from Russian spelling to English meaning. When napHHv comes out “Parizh,” he 
starts to wonder, and he soon finds out, with HepHOe Mope, that Russian-spell- 
ing-to-English-meaning usually has to go through the step “Russian meaning:” 
“ Chernoe , can that be right? More , what’s that?”) 


X X El 

ioxe: 


XE 
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question it is necessary first to invoke the names of Tschaikowsky, 
Chekhov, and Khrushchev 4 . 

A person who does not know Russian sees no illogicality in the 
spellings of the familiar names Tschaikowsky and Chekhov. He 
simply assumes that the conventional spellings in Latin letters reflect 
some strange Russian spellings. He is so used to these conventional 
spellings that he rejects out of hand the suggestion that, since both 
Russian names begin with the Russian letter H one could easily throw 
away the Ts in Tschaikowsky with no loss 5 . No, he feels, Chaikowsky 
and Chajkovskij “look funny” 6 but Tschaikowsky “looks right.” So 
does Chekhov, and so does Rachmaninoff. Even though the w in 
Tschaikowsky, the v in Chekhov, and the ff in Rachmaninoff all rep¬ 
resent the same letter b. And even though the kh in Chekhov and the 
ch in Rachmaninoff represent the same Russian letter x 7 . 

Are all those variations completely illogical? Actually, the person 
who knows Russian realizes that the differences in the Latinized spell¬ 
ings of Russian names on record albums and on book covers are 
usually the result of the fact that some Russian names might become 
better known in an “English” spelling (Kutuzov, Lobachevsky, 
Shostakovich), others in a “French” spelling (Koussevitzky, Mous- 
sorgsky, Chaliapin, Scriabine), and still others in a “German” spelling 
(Rachmaninoff, Tschaikowsky, Gabrilowitsch). These differences 
also explain the situation—often baffling to a person who does not 
know Russian—in which a person with the last name Vasilev has a 


4 These names are apparently as ritualistically vital at the start of a discus¬ 
sion of Russian transliteration problems as the name of the Almighty is, when 
used to tag the last sentence of a political speech at an American nominating 
convention. 

5 One of my music-loving friends feels that, while we’re at it, we might just 
as easily throw away all of Tschaikowsky’s music with no loss either, but that is 
a matter beyond the scope of this article. 

6 So does the spelling of the identically pronounced Polish name Czajkowski. 
But since Polish is written in Latin letters, there is no question of choosing among 
various transliterations. 

7 Incidentally, the Russian letter X, used by itself, is not equivalent to Latin 
X in the meaning “unknown.” Instead, the Russian letter H is used in that 
sense (e.g., noJiKOBHHK H. = “Colonel X.”) and the Russian letter X connotes a 
not too acceptable Russian noun, and its various affectionate, diminutive, aug¬ 
mentative, pejorative, and other derivative forms. Does that explain why 
Russian-speaking Americans, seeing references in the press to President Kennedy 
and Nikita Khrushchev as “the two K’s,” used to suggest calling the latter 
“Comrade X.,” and then grin at one another? Or why sweet young things who 
are asked to translate Russian texts likely to contain crude expressions are ad¬ 
vised, “If you can’t find it in the dictionary and it begins with x, don’t ask!” 
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TRANSLITERATION 


brother with what looks like the completely different last name Wassil- 
jew. 

The examples in the preceding paragraphs show how the process of 
representing Russian in the various languages written in Latin letters 
is complicated because the individual languages have their own spell¬ 
ing conventions (Russian y is usually transliterated into English as u , 
but into French as ou). In addition, inconsistencies develop between 
transliteration (the representation of a word in a different language by 
means of an alphabetic, letter-by-letter conversion) and transcription 
(the representation of a word in a different language by means of a 
more or less phonetic, sound-by-sound conversion). Cf. IIoTeMKHH = 
Potemkin or Potyomkin. The process of transcription, in its turn, 
is often complicated by the lack of completely equivalent sounds in 
the various languages (e.g., the Russian letters x and bi which represent 
sounds that do not occur in English) and by subjective interpreta¬ 
tions and preferences. 

Inconsistencies in the representation of Russian words have a long 
history. The name “Moscow,” for example, is a partial translitera¬ 
tion of MocKBa, but the seventeenth-century name “Muscovy” is 
obviously a transcription, as attested to by the accurate representa¬ 
tion, in the first syllable, of the feature of anaHBe (pronunciation of 
unaccented 0’s as A’s) typical, then as now, of the Moscow area. 

Once established, these inconsistencies are hard to change, much 
less to “correct.” By now, English-speaking people are used to the 
spelling Moscow, rather than Moskva, Maskva, Muskva, or Muscow. 
They are also used to seeing the name Tschaikowsky in the German 
transliteration, but would probably not recognize it as listed in the 
Library of Congress files (ChaikovskiT), just as they become used to 
the spelling Khrushchev but would probably not have been able to 
recognize the German spelling Chruschtschew. 

Conversely, most Russians, long accustomed, from reading trans¬ 
lations of James Fenimore Cooper’s works, to the letter-by-letter 
transliteration TOMaraBK (“tomahawk”), would not recognize a more 
accurate, but still less than perfect, transcription 8 . 

The average translator of Russian is familiar enough with some 
other European language to understand why the name IlymKHH, for 
example, assumes the following forms in the various European lan¬ 
guages: 


8 For one thing, what does Russian do with the sound represented by English 

h, a sound which is not present in standard Russian? 
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Pushkin 

Pouchkine 

Pusckin 

Puschkin 


English 

French 

Italian 

German 


Poes j kin 
Puskin 
Puszkin 
PuSkin 


Dutch 

Hungarian 

Polish 

Czech 


Accordingly, he can usually explain, in terms of transliteration or 
transcription from Russian into English, French, or German, the 
seeming inconsistencies in the spelling of Russian words in English 
dictionaries such as the Webster’s Third New International Dic¬ 
tionary 9 . Some of these words are cited in Webster’s in one spelling 
only. Usually this is a spelling which corresponds, more or less, to 
ordinary English spelling conventions 10 , e.g., 


apparat 

annapaT 

Krivoi Rog 

Kphboh Por 

apparatchik 

annapaT^HK 

Makeevka 

ManeeBKa 

artel 

apTejit 

Nikolaev 

HnKOJiaeB 

babushka 

6a6yniKa 

oblast 

o6jiacTB 

Balaclava 

BajiaKJiaBa 

samogonka 

caMoroHKa 

balalaika 

6ajiajiaHKa 

samovar 

caMOBap 

chastushka 

^acTyinKa 

Samoyed 

CaMoeA 

Chechen 

HeneH 

Saratov 

CapaTOB 

Cheka 

Hena 

solonchak 

COJIOHHaK 

Chernovtsy 

HepHOBIJBI 

sputnik 

cnyTHHK 

chernozem 

^epHoaeM 

taiga 

Tanra 

Ivan 

HBaH 

troika 

TpoiiKa 

kasha 

Kama 

tundra 

TyHApa 

Kharkov 

XaptKOB 

Ulan-Ude 

yjiaH-YAs 

Khlyst 

Xjibict 

vodka 

BOAKa 

Kiev 

KneB 

Yaroslavl 

HpocjiaBjib 

kolkhoznik 

K0JIX03HHK 

Yenisei 

EHHceit 

Komi 

Komh 

Zhdanov 

£KAaHOB 

Kremlin 

KpeMJIB 




9 Prerevolutionary and Soviet proper names and other Russian words have, 
for the most part, been borrowed directly from the Russian or through French 
and German. An occasional Russian word may have been borrowed through a 
European language other than the three mentioned (e.g., “czar,” borrowed 
through the Polish spelling). But so far as I know, there is no large body of 
Russian words in Webster’s spelled, say, in Dutch or Hungarian. I do, however, 
have a phonograph record cover, written in English, which lists Russian song 
titles such as “Nje rewnoej” and “Sjewodnja La Loebloe.” These titles remain 
unintelligible until one reads the comment that the record was made by Russian 
emigres now living in the Netherlands. Then it is easy to recognize the (some¬ 
times garbled) Dutch spellings of Russian words. 

10 In these conventional English spellings, H and it, and sometimes BI, are 
rendered as i, e is rendered as e or ye, K is sometimes rendered as c, x and JK are 
rendered as kh and zh , and B is ignored. 
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Occasionally the single spelling in Webster’s may be in accordance 
with French transliteration or may include an attempt to represent a 
palatalized Russian consonant, e.g., 

knout 11 KHyT 

Tchebycheff inequality TeGbimeB 

soviet coBeT 

But usually, in a spirit of “you-pays-your-money,” Webster’s lists 
two or more spellings per Russian word. Some of these sets of vari¬ 
ants are understandable as alternate English/French/German spell¬ 
ings, but others are understandable only as various ways, traditional 
or otherwise, of getting the word into something like intelligible 
English. Sometimes Webster’s throws all the variants of a Russian 
word into one steaming pot from which, on the end of a long stick, 
the user is supposed to fish out his preference. Although Webster’s 
gives the definition under its preferred form (the first word in each set 
of variants below), that preference is not always consistent. E.g., 


Archangel, Arkhangelsk 
Belorussian, Bielorussian, Byelorussian 
beluga, belouga 

borsch, borscht, borsht, bortsch, borshch 
caftan, kaftan 
Cheremis, Cheremiss 

chervonets, chervonetz, tchervonets, tchervonetz 

chetvert, tchetvert 

chevkinite, tscheffkinite 

guberniya, gubernia 

intelligentsia, intelligentzia, 

jarovize, iarovize, yarovize 

Kirghiz, Kirghese, Kirghis, Kirgiz, Khirghiz 1 * 

kok-saghyz, kok-sagyz 

kolkhoz, kolkoz, kolkhos 


ApxaHrejibCK 

EejiopyccKHH 

6ejiyra 

6opm 

Ka$TaH 

HepeMnc 

nepBOHen 

MeTBepTt 

HeBKHHHT 

ry6epHHH 

HHTejurareHUHH 

HpOBH3HpOBaTb 

KnprH3 

KOK-carbi3 

K0JIX03 


“Pronounced /'naut, nut/. The entry “knut” /(kd)'ndt/ is something else 
entirely (“a fop of the late 19th and early 20th century”). 

12 So where’s “Khirhghizh”? 
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Korsakoff’s psychosis, Korsakow’s psychosis 13 

KopcaKOB 

koumiss, kumiss, kumys, kumyss 

KyMbIC 

krym-saghyz, krim-saghyz 

KpbiM-carbi3 

muzhik, muzjik, mujik, moujik 

My?KHK 

piroshki, pirojki 

IIHpO/KKH 

politburo, politbureau 

nojiHT6iopo 

polynya, polynia 

nOJIbIHbH 

sastruga, zastruga 

3acTpyra 

Sebastopol goose 

CeBacTonojib 

Sevastopol 

CeBacTonojib 

shashlik, shashlick, shaslick 

UiaiHJIblK 

solonetz, solonets 

cojioHeu; 

sovkhoz, sovkhos, sovhoz 

C0BX03 

technicum, technikum 

TeXHHKyM 

tovarich, tovarish 

TOBapum 

ukase, ukaz 

yna3 

uvarovite, ouvarovite, uwarowite 

yBapoBHT 

Uzbek, Uzbeg, Uzbak, Usbek, Usbeg 

Y36eK 

Yerevan, Erevan, Erivan 

EpeBaH 

zemmi, zemni 

3eMHbIH 


The inconsistencies in the English spelling of Russian words in 
Webster’s could actually be worse, since there are at least 15 Russian 
transliteration systems currently in use at various governmental and 
private institutions, libraries, research projects, etc., in the United 
States, Great Britain, France, and Germany alone. The following 
table shows a few of the most common ones. It could easily have 
been made even more complex by including transliteration systems 
used at individual libraries such as the New York Public Library, and 
individual erratic transliterations based on the pronunciation, rather 
than the spelling (e.g., “e after consonant = ye, as in HeT = nyet; or 
kc = x, as in AjieKceii = Alexei). The dashes in the table indicate 
that the transliteration in the particular system is the same as in the 
BGN system. 


13 “After Sergei Korsakov.” 
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TRANSLITERATION 


BGN a 

LC b 

NSA C 

AATSEEL d 

MT e 

French f 

German 1 ? 

a 

a 

- 

- 

- 

- 

- 

- 

6 

b 

- 

- 

- 

- 

- 

- 

B 

V 

- 

- 

- 

- 

v, ff h 

w, ff ‘ 

r 

g 

- 

- 

- 

- 

g, gu', ghi 

- 

A 

d 

- 

- 

- 

- 

- 

- 

e 

e, ye k 

e 1 

e 1 

e 

e 1 

e, ie 

e, je 

e 

e, ye k 

e 1 

e 1 

e 

e 1 

yo 

jo 

7K 

zh 

- 

- 

z 

j 

j, g 

sh 

3 

z 

- 

- 

- 

- 

- 

z, s 

H 

i 

- 

- 

- 

- 

i, i m 

- 

H 

y 

T n 

j 

j 

1 

i 

j 

K 

k 

- 

- 

- 

- 

k, c 

- 

JI 

l 

- 

- 

- 

- 

- 

- 

M 

m 

- 

- 

- 

- 

- 

- 

H 

n 

- 

- 

- 

- 

n, ne° 

- 

0 

o 

- 

- 

- 

- 

- 

- 

n 

P 

- 

- 

- 

- 

- 

- 

P 

r 

- 

- 

- 

- 

- 

- 

c 

s 

- 

- 

- 

- 

s, ss p 

s, ss q 

T 

t 

- 

- 

- 

- 

- 

- 

y 

u 

- 

- 

- 

- 

ou 

- 

4> 

f 

- 

- 

- 

- 

- 

- 

X 

kh 

- 

- 

X 

X 

- 

ch 

a 

ts 

ts n 

- 

c 

q 

- 

z 


ch 

- 

- 

c 

c 

tch 

tsch 

in 

sh 

- 

- 

s 

w 

ch 

sch 

iA 

shch 

- 

- 

w 

sc 

5 

chtch 

schtsch 

Tb 

H 

- 

- 

- 

7 

y 

y 

BI 

y 

- 

- 

- 

- 

_r 

- 

B 

/ 

- 

- 


6 

y a 

j 

3 

e 

e 

eh 

e 

3 

eh 

e 

K) 

yu 

iu n 

/ 

— 

ju 

h 

you, iou 

ju 

H 

ya 

ia n 

- 

ja 

4 

ya, ia 

ja 


a U. S. Board on Geographic Names. Most U. S. government 
agencies adhere to the ruling given in the GPO Style Manual that 
4 ‘The spelling of geographic names must conform to the decisions of 
the U. S. Board on Geographic Names.” For purposes of consist¬ 
ency, those agencies also use the BGN system for transliterating Rus¬ 
sian words other than geographic names. 

b Library of Congress. This is one government agency which does 
not use the BGN transliteration system. 
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c National Security Agency. Like the Library of Congress, the 
National Security Agency is authorized to use its own Russian-to 
English transliteration system. NSA does, however, use the LC sys 
tern for library accessioning. 

d American Association of Teachers of Slavic and East Europear 
Languages. System based on Czech alphabet. 

e System devised for machine-translation (MT) research by George 
town University and subsequently adopted by IBM for the sann 
purpose. Each Russian alphabetic symbol is unambiguously repre 
sented by a single Latin letter or arabic numeral. (Ambiguity be 
tween use of arabic numeral to represent a Russian alphabetic symbo 
and use as an actual numeral is reconciled by assigning a code prefi: 
to actual numerals.) 

Combination of common Russian-to-French transliteration sys 
terns. 

Combination of common Russian-to-German transliteration sys 
terns. 

h When final (e.g., YpycoB, Ouroussoff). 

When final (e.g., KucejieB, Kisselyoff). 

When followed by e or h (e.g., TypreHeB, Tourgueneff, flarmieE 
Diaghileff). 

When preceded by consonant. Otherwise (initial, or when pre 
ceded by vowel, t> or b) ye, ye. e and s indistinguishable in translitera 
tion when preceded by consonant. 

1 Always e, regardless of whether initial or preceded by vowel, con 
sonant, tb or b; no distinction in transliteration between e and € 
m When preceded by o (e.g., ctoht, stoYt); hh = y. 
n The “modified” LC system omits diacritics; i, ts , iu, ia respectively 
When final and preceded by vowel (e.g., JlemiH, Lenine). 
p Between vowels (e.g., KucejieB, Kisselyoff). 

^Between vowels (e.g., TepacHMOB, Gerassimoff) or initial (Comoe 
Ssomoff). 
r bm = y. 

transliterated when followed by vowel (e.g., CaBejibeB, Savelyeff) 
ignored when followed by consonant or at end of word (e.g., Kojibijoe 
Torojib, Koltsoff, Gogol). 

With so many transliteration systems in existence, it obviously i 
not a question of which one is “right.” It's analogous to decidin 
which is the “right” color to repaint the living room walls. First o 
all, what’s wrong with the present color? Then, if they really have t 
be painted, wouldn’t one of the premixed colors at the paint store b 
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close enough to what is desired? If not, how much does it cost to 
have colors custom-blended? Would changing the wall color require 
buying new carpets and new furniture for the living room? And the 
dining room? Would buying a new dining room set necessitate en¬ 
larging the dining room? Wouldn’t knocking out the dining room 
wall mean chopping down the flowering crab just when it was begin¬ 
ning to look so pretty? Wouldn’t buying a new fully-grown replace¬ 
ment from the nursery be better than buying one of those 2-foot 
sticks they sell at Safeway? Anyone who knows how these discus¬ 
sions usually end (“Let’s just leave the walls seasick green!”) can 
imagine how the Library of Congress, with its transliteration system 
nicely worked out and its Russian accessions neatly catalogued for a 
generation, felt when, toward the turn of the century, it became the 
law of the land that spellings of geographic names in U. S. government 
publications would be in accordance with decisions of the U. S. Board 
on Geographic Names. How many file cards to change? How 
many man-hours to train new people to change them? How many 
months to change how many cards? How many walls to knock out? 
... Is it really surprising that, 60 years after the decision to standard¬ 
ize the spelling of geographic names in U. S. government publications, 
the Library of Congress lists Soviet place names at two separate 
locations in the catalogue when the LC transliteration differs from 
the BGN? 14 

That arrangement led to an amicable settlement between BGN 
and LC, so that nowadays one seldom hears remarks like “You and 
your crummy diacritical marks!” and “Oh yeah! Well, what about 
your crummy ‘y’ that you throw in when you can’t think of anything 
better to do!” It also united LC and BGN spiritually when, still 
another generation later, another Johnny-come-lately government 
agency said, “A pox on both your transliteration systems!” And 
here we stand, in the Computer Age, with Russian words spelled 
every which way so that the poor dumb machine doesn’t know which 
way to turn. And the walls are still painted that same seasick green. 

One of the few virtues of strict adherence to any transliteration sys¬ 
tem is that it obviates the translator’s necessity to cope with un¬ 
realistic rules like “If a Russian place name already has a standard¬ 
ized English spelling, that spelling should be substituted for the 


14 The^difference between Pervomaisk (LC) and Pervomaysk (BGN) or be¬ 
tween U1 ianovskoe (LC) and Ul’yanovskoye (BGN) is perhaps slight enough 
not to confuse or inconvenience the user of the catalogue, but there is a very 
large number of catalogue cards and a considerable number of catalogu^-user/feet 
between the spellings Erevan (LC) and Yerevan (BGN), or between LAYlo (LC) 
and Yaylo (BGN). 
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transliteration.” Instead he can systematically ignore all such un¬ 
systematic “standardizations” 15 . Thus, 

BGN NSA But not 


ApxaHrejitcK Arkhangelsk 

AcTpaxaHb Astrakhan’ 

TopbKHH Gor’kiy 

(Cf. TopKH, Gorki) 

KyitobimeB Kuybyshev 

CeBacTonojib Sevastopol’ 


Arkhangelsk 

Astrakhan’ 

Gor’kij 

Kujbyshev 

Sevastopol’ 


Archangel 

Astrakhan 

Gorki 

Kuibyshev 

Sebastopol 


Unlike transliteration, which is the attempt to change the rendition 
of a word from one writing system to another writing system, trans¬ 
cription is the attempt to render in a particular writing system not 
the way a word looks, but the way it sounds 16 . Thus, transcription 
need not even involve a second language. It may be unscientific 
(e.g., thUs, tranz-KRIP-shun, LANG-uage), semiscientific (thus, tran- 
skrYp'-shim, lang'-gwYj), or scientific (dAs, traenskrip{n, lae^gwldj). 

The problem of transcription (or “phonetic transcription”) is made 
more complicated when a word from one language is being transcribed 
into another, since it is a rare phoneme that is exactly the same in any 
two languages. Obviously the solution that is closest to the ideal is 
a scientific transcription system that renders unambiguously all the 
meaningful sounds of all languages, not excluding Hottentot clicks or 
the Russian bi. This system, like the All-Purpose Universal Interna¬ 
tional Transliteration Scheme, will probably never be achieved in 
this world. In the meantime the IPA (International Phonetic Asso¬ 
ciation) system is as good as any. According to this system a French 
or Russian linguist will recognize in /&. as/, /traenskrip/n/ and /lae^gwidj 
the English words “thus,” “transcription,” and “language,” and an 
American linguist will recognize in ,/alQr/, /trdskripsjo/, and /lagaij/ 
the French words “alors,” “transcription,” and “langage,” and will 
recognize in /tak/, /transkripcijd/, and /jdzfk/ the Russian words 
Tan, TpaHCKpmm,HH and H3biK. 

But words can also be transcribed from one language to another 
according to a less than scientific system. They often are, in fact, 
not only because typewriters lack the necessary diacritical marks and 
because print shops lack the necessary type fonts, but mostly because 
nonlinguists are scared away from strange-looking symbols. The 
English words “thus,” “transcription,” and “language” spelled in a 


15 Except that everyone agrees that, despite the rules, it would be ridiculous 
to treat MoCKBa and KaeB as anything other than “Moscow” and “Kiev.” 

16 This is comparable to English shorthand or stenotypy, when, for example, 
“cats” and “Katz” are written with the same symbols. 
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system that is “easy-to-pronounce” for a Frenchman might look like 
“zasse,” “tranne-SCRIPPE-chonne,” “LINGUE-gouadzhe,” and 
spelled to be “easy-to-pronounce” for a Russian might look like 3ac, 
TpsHCKpHimm, jisHrysjjjK. Certainly there is little need to remind the 
reader that he has seen similar systems for making French and Rus¬ 
sian “easy-to-pronounce” for the English-speaking student: “ah- 
LOHR,” ‘‘ trahn-screep-see-OHN,’’ lawn-GAHZHE 9 ’; “tahk,” “trahn- 
SKREEP-tsee-yah,” “yah-ZOOIK.” 

The major problem that transcription poses in technical translation 
is recognizing when a non-Russian place name or personal name has 
been transcribed (in one system or another) instead of being trans¬ 
literated (in one system or another). There being no logical pattern 
for treatment of foreign words borrowed from one language into 
another, non-Russian place names or personal names sometimes have 
two or more accepted variants in Russian. One variant might be a 
letter-by-letter transliteration that takes no consideration of the 
actual phonemes in the original language. By this method, Newton, 
Shakespeare, Shaw, and Harrow would be rendered HeBTOH, CxaKs- 
cnsaps, CxaB, TappoB 17 . The other variant might be an attempt at a 
phonetic equivalent, by which method the same English names would 
be rendered Hbiotoh, nieKcimp, IHoy, Xsppoy. Or the German city 
Hildesheim might be rendered as rHjiB^ecreiiM or XmibAecxaftM. 

Letter-by-letter transliterations such as HeBTOH are typical of an 
age when international communications were conducted mostly by 
means of the written word and French was the language of diplomacy. 
Transcriptions such as Hbiotoh are typical of the present age of elec¬ 
tronic communications allowing the transmission of the actual pro¬ 
nunciation of the name or other word to be assimilated into another 
language. True, these pronunciations are often ignored by large 
segments of the population. The power of the printed word is still 
so great in determining the generally accepted pronunciation of 
foreign names and other words that one almost wishes that cnyTHHK 
were assimilated into English in the spelling “spootnik” (pronuncia¬ 
tion used by one news announcer for every hundred that use a pronun¬ 
ciation rhyming with “nut pick”), or that XpymeB had been rendered 
in the American press as “Khrushchoff” (which would at least have 
lessened the frequency with which one heard “Crew Chef”). 

At any rate, when foreign place names and personal names are 
cited in the daily press or in internationally oriented scientific litera¬ 
ture, they often assume a form midway between transliteration and 

17 You and I know that these spellings look ridiculous, but apparently Rus¬ 
sians don’t. A Russian-language newspaper published in New York, for example, 
used to run ads for the AppOB (‘‘Arrow”) farm. 
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transcription. For that reason non-Soviet place names and personal 
names appearing in Russian text can only rarely be transliterated by 
the same letter-by-letter procedure that is used for Soviet names. 
Pe 3 ep$op a is not “Rezerford,” but “Rutherford.” ToMejib is “Gomel” 
but Mapcejib is “Marseilles.” Eohijob is “Bojtsov” but KpomteHren- 
Mep is “Kreuzenheimer.” 

If the translator of technical Russian encounters in Russian text 
place names in a language with which he is unfamiliar, he can usually 
find them on a map printed in Latin letters. By comparing that 
map with one printed in Cyrillic characters, he can discover, almost 
involuntarily, that Russian Y, for example, is usually equivalent to 
French “ou” or Dutch “oe” (e.g., Tyjiy3, Fyc, Toulouse, Goes). Maps 
can also be of great use in determining how to render non-Soviet per¬ 
sonal names encountered in Russian text. By comparing Soviet and 
American maps of France, for example, one can easily construct a 
Russian-to-French conversion chart that can be of partial assistance 
in deciphering Russian transcriptions of French personal names 18 . A 


ai 

BoAJiep 

Baudelaire 

ee JX oope 

Daubree 

ai 

JleMeTp 

Lemaitre 

ei KopHeJib 

Corneille 

ais 

BoMapme 

Beaumarchais 

er 0?Ke 

Auger 

aix 

: CaMne 

Sampaix 

es JfernaH 

Deschamps 

ay 

Kop^e 

Corday 

et En3e 

Bizet 

e 

Jlenam 

Lepage 

eu IleKKep 

Pecqueur 

e 

Aoejiap 

Abelard 

eui BepHeJib 

Verneuil 

e 

AMnep 

Ampere 

ez JIiOMypbe 

Dumouriez 

e 

AHryjieM 

Angouleme 




conversion chart that would be more reliable than one reflecting the 
vagaries of French spelling might be one prepared from a comparison 
of Soviet and American maps of China, containing such equivalent 
values as w k = ch (Wade-Giles transcription), h = clT, hh = ing, 
HHb = in. The values contained in such a chart could then be used 
both for place names (Tjkshahh = Chengting, not Chzhendin), and 
for personal names T^Koy 9Hb-jiafi = Chou En-lai, not Chzhou Ehn'- 
laj). 

By way of an exception, in order to facilitate library searching, 
non-Soviet personal names should be both transliterated and trans¬ 
cribed when they occur in Russian-language bibliographies. E.g., 

18 I’m afraid that this procedure is not as simple as it sounds. To get an idea 
of how complicated it is to convert Russian spellings of French names back into 
the original spellings, one need consider only the Russian vowel e and some (I 
have been fooled too many times to say “all”) of the spellings it represents in 
French names. 
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Chzhou Ehn'-laj [Chou En-lai], “Speech at the Third Session of 
. . . Narodnyj Kitaj [People’s China], . . . 

Uitmen, Uolt [Whitman, Walt], Lisfya travy [Leaves of Grass], 
Moscow, 1955. 

Names like Tncoy SHb-jiaii are instantly recognizable as being non- 
Soviet. For one thing, where’s the -obhh? But what if the name is 
that of a less well-known personality? Even then the translator will 
often have a clue, because usually the presence of two initials before 
a last name in Russian text indicates that the person referred to is or 
was a Russian citizen, e.g., H. C. XpymeB, A. II. Tcxob. But usu¬ 
ally the presence of a single initial before a last name indicates a non- 
Russian, e.g., III. Ae Tojuib. JL 9ii3eHxay3p. 

While this is not an invariable rule, it is observed often enough to 
serve the translator as a general guide for the treatment of personal 
names. Russian names (pre- and postrevolutionary) are translite¬ 
rated in accordance with the preferred transliteration table. But 
non-Soviet names should be returned to their original spellings in 
Latin letters or, for non-European languages, to a more or less stan¬ 
dardized spelling in Latin letters. 

Hence the difference in treatment between Soviet names like: 


r. n. rojibu; 

A. M. neiHKOBCKHH 
M. E. Tep-MimacHH 

and names like: 

Not 


G. P. Gol’ts 
A. M. Peshkovskij 
M. E. Ter-Minasyan 


But 


A. Tojiba 

B. HIhhikobckhh 
T. Kaii-uiH 

C. HocnAa 


A. Gol’ts 

B. Shishkovskij 
Ch. Kaj-shi 

S. Iosida 


A. Holz 

B. Szyszkowski 
Chiang Kai-shek 
S. Yoshida 


(German) 

(Polish) 

(Chinese) 

(Japanese) 


Names of well-known world personalities usually offer no difficulty 
to the translator, who automatically converts the spelling Ae Tojuib 
for example, to “De Gaulle,” and, recognizing in the letter III. the 
French pronunciation of the digraph “ch,” transcribes it as “C.” in¬ 
stead of transliterating it as “Sh.” 

But when less well-known persons are mentioned in Russian text, 
the translator often has to do a certain amount of searching in biblio¬ 
graphical or other sources to find the actual spelling. If this search 
is unsuccessful, it is better for him to transliterate the name than to 
guess. The treatment “Dzh. Kelli [as in text]” is better than the 
bland assumption “J. Kelly” (since “G. Kelley” is as valid a guess). 
Another acceptable treatment for a name which the translator has 


14 




A. J. SALEMME 



been unable to locate in its original spelling is “Dzh. Kelli [retrans¬ 
literated from Russian]” 19 . 

Incidentally, in at least two instances the initials of Soviet citizens 
require special treatment in translation. The first is when an author 
supplies his initials to indicate that he has added his own emphasis 
to a cited quotation. An article by K. I. Bakhman, for example, 
contains the sentence, “A. A. Pe^opMaTCKmi yTBep^KAaeT, hto rpaMMa- 

THKa JIK)6orO H3bIKa MOHveT BbipaHvaTbCH TOJIbKO 3 THMH C II O - 

c o 6 a m h (pa 3 paAKa moh - - K.B.)” The author’s initials after such 
expressions as pa3pnAKa moh (“my spacing”) and noA^epKHyTO mhoh 
(“my italics,” “italics supplied”) should be suppressed in the Eng¬ 
lish translation 20 . 

The second instance is when an author refers to himself in a context 
that balances delicately between the formal and the familiar. For 
example, the name of the author of an article written by a kinder¬ 
garten teacher was given in a Soviet journal, in the ordinary formal 
manner, as A. K. Matveeva. In the article the author quoted sev¬ 
eral bits of conversations with the children, in which references were 
made to the cook and other persons by the familiar form of address 
with the first name and patronymic. But, describing another situa¬ 
tion, the author quoted one child who began a question politely, 
H 3 BHHHTe, A. K., ho . ..” Obviously this creates a problem for the 
translator. He cannot guess what first name and patronymic the 
child actually used (Anna Konstantinovna? Aleksandra Kuz’mini- 
chna?). Neither can he translate it as, “Excuse me, A. K., but . . .” 
since that might create the impression that Russian kindergarten 
children speak like Madison Avenuers. The best he can do is trans¬ 
late the polite beginning as “Excuse me, Miss Matveeva, but . . .” 
(ignoring the possibility that she might be Mrs. Matveeva). 

Personal names also occur occasionally as part of the names of 
Soviet institutions, which are named after people, famous or other¬ 
wise, by adding the word HMeHH (“name of”) and the person’s name 
in the genitive case to the basic name of the institution (e.g., Mockob- 
ckhh rocyAapcTBeHHbiH yHHBepeiiTeT hmchh M. B. JIoMOHOcoBa). The 

19 This treatment is analogous to the treatment of translations into Russian 
of excerpts from books published originally in a different language, say French. 
A translation from Russian into English might look different from a translation 
from the original French into English. Similarly, a back translation of a Rus¬ 
sian excerpt from an English-language source might look very different from the 
original English. If the translator can locate the original source of such excerpts, 
he can act appropriately. But if he cannot locate the source, it is desirable to 
indicate “[retranslated from Russian]”. 

20 Note to the young translator: Enjoy this sensation. It isn’t often you’re 
allowed to, much less advised to, throw anything away. 
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translation treatment of such institutional names varies according to 
editorial preference. Some editors, attempting to achieve a form 
similar to that of English (“Walter Reed Army Hospital”), prefer 
treatments such as “M. V. Lomonosov Moscow State University.” 
Others prefer keeping the same word order as in Russian, and either 
transliterating the word HMeHH or rendering it by the arbitrary abbre¬ 
viation “i/n” (for “in name of”): “Moscow State University imeni 
(or: i/n) M. V. Lomonosov.” 

Some editors object that moving the person’s name to the front 
when translating long institutional names (treating, for example, 
Mockobckhh xyAOJKecTBeHHbiii aKajjeMHuecKHH TeaTp CCCP HMeHH M. 
TopbKoro as “M. Gor’kij Moscow Art Academy Theater of the USSR”) 
produces names reminiscent of the “New Ashmolean Marching Society 
and Students Conservatory Band;” But what’s so wrong with that? 
One achieves, in fact, a certain military snap by translating KpacHo- 
3HaMeHHaH, op^eHa JleHima h opjjeHa CyBopoBa 1-h cTenemi BoeHHaa 
anaAeMHa HMeHH M. B. OpyH3e as “M. V. Frunze Red Banner Order of 
Lenin and First-Degree Order of Suvorov Military Academy.” (Al¬ 
though I’d hate to write the lyrics, in any language, for that academy’s 
school song.) 

Whether or not the translator moves the person’s name to the front 
(check with your friendly editor), he should always remember to 
change any inflected form from the genitive case to the nominative. 
In addition, if a Soviet institution is named after a non-Russian, he 
should remember to spell the name appropriately, instead of trans¬ 
literating it: Mockobckhh neAarornuecKHH HHCTHTyT hmchh Kapjia 
JIn6KHexTa, “Moscow Pedagogical Institute imeni Karl Liebknecht.” 

In addition to Soviet and non-Soviet personal and place names, 
certain other classes of words have to be transliterated instead of 
translated. These include various types of abbreviations, the names 
of certain Soviet chemical compounds, medicines, synthetic fibers, 
etc. The translation problems that these words pose could easily fill 
another article. Until that article hits the stands, here are a couple of 
teasers. When is CCCP not the well-known abbreviation such as 
seen, among other places, on the sweatshirts of Russian lady shot- 
putters? If xjiop is “chlorine,” what’s xjiophh? Choose one: The 
preferred rendition of MnapceHOJi (a Soviet antisyphilitic compound) 
as “miarsenol” instead of “myarsenol” (a) is (b) is not coincidental to 
the fact that the word was not formed by mimicking a Cockney de¬ 
scribing the ravages of the disease? 
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Mokusatsu: One Word, Two Lessons 


BY HARRY G. ROSENBL'UH 


Sjc xrv 

mokusatsu #».£ TX—suru, v. take no notice of; treat 
(anything) with silent contempt; ignore [by keeping 
silence 1; remain in a wise and masterly inactivity. 

—Kenkyusha’s New Japanese - English Dictionary, 
p. 1129. 

The story of how an ill-chosen translation of the Japanese word 
mokusatsu led to the United States decision to drop the world’s 
first atomic bomb on Hiroshima is well known to many linguists. 
But perhaps it would not be amiss to retell it briefly just in case 
some reader of this essay is unfamiliar with the word—and in the 
hope that readers may be inspired to avoid the two tragic linguistic 
errors that the story points up. 

In July of 1945 allied leaders meeting in Potsdam submitted a 
stiffly-worded declaration of surrender terms and waited anxiously 
for the Japanese reply. The terms had included a statement to the 
effect that any negative answer would invite “prompt and utter 
destruction.” Truman, Churchill, Stalin, and Chiang Kai-Shek 
stated that they hoped that Japan would agree to surrender un¬ 
conditionally and prevent devastation of the Japanese homeland 
and that they patiently awaited Japan’s answer. 

Reporters in Tokyo questioned Japanese Premier Kantaro Suzuki 
about his government’s reaction to the Potsdam Declaration. Since 
no formal decision had been reached at the time, Suzuki, falling 
back on the politician’s old standby answer to reporters, replied that 
he was withholding comment. He used the Japanese word mokusatsu , 
derived from the word for “silence.” As can be seen from the dic¬ 
tionary entry quoted at the beginning of this essay, however, the 
word has other meanings quite different from that intended by 
Suzuki. Alas, international news agencies saw fit to tell the world 
that in the eyes of the Japanese government the ultimatum was 
“not worthy of comment.” U. S. officials, angered by the tone of 
Suzuki’s statement and obviously seeing it as another typical ex¬ 
ample of the fanatical Banzai and Kamikaze spirit, decided on 
stern measures. Within ten days the decision was made to drop the 
atomic bomb, the bomb was dropped, and Hiroshima was leveled, [l] 
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Almost without exception, whenever this story is told, mention 
is made of the poor translation job. One short magazine article [2] 
calls it “The World’s Most Tragic Translation,” “the ill-chosen 
translation of a common Japanese word,” “disastrous oversight in 
this most important of all messages,” and “that inauspicious trans¬ 
lation.” Indeed, there seems to be little question about the trans¬ 
lator’s culpability. 

Many people, especially non-linguists, seem to feel that every 
word in one language has an exact counterpart, a perfectly equiv¬ 
alent match, in every other language. So, given a word in Language 
A, this can mean only one thing in Language B, and that one thing 
will be exactly what was meant in Language A. Obviously, this is 
not true. Different peoples with different cultural backgrounds view 
things differently and their languages reflect this difference of view¬ 
points. For example, given six animals common to several regions, 
one language group may categorize the animals into two classes 
because of size and have only two words in their language (one for 
large animals, one for small animals); another people may use the 
animals’ eating habits as their criterion and also have two words 
(one for carnivores, one for herbivores) encompassing different groups 
of animals. Another people may subdivide the animals by color and 
end up with four words in their language, and yet another people 
may not do any subdividing at all; so they’ll have six words to use in 
talking about these animals. It’s also possible that some other group 
may have separate names for the male and female of the species (as 
English does for ram and ewe, gander and goose, etc.), so that they 
will have twelve different words! In addition, there are other criteria 
that could be used, and the number of words could be increased or 
decreased; or several languages might have the same number of 
words but, using completely different criteria for their subdivision, 
they would concern completely different things. 

Another linguistic problem that keeps every word in one language 
from having a counterpart in every other language is that often 
something which is commonplace to speakers of one language will 
be totally unknown to speakers of another tongue. They have no 
concept of the thing; so how can they have a word for it? This is a 
problem frequently encountered by Bible translators. How, for ex¬ 
ample, do you translate “Lamb of God” into an Eskimo dialect 
whose speakers don’t have the slightest idea of what sheep are? Or 
“anchor” into a language spoken by nomadic dwellers in the midst 
of the Sahara Desert? 

Quite often a term can be translated word for word, but the re¬ 
sulting translation carries completely disparate meanings in both 
languages because of cultural differences. A missionary in Africa 


18 


H. G. ROSENBLUH 


ran into this problem with the sentence, “Behold, I stand at the door 
and knock” (1 John, 3:20). In that region only a thief knocks on doors; 
if anyone responds, the intruder runs away. Visitors with honest in¬ 
tent shout the name of the person in the hut. The translator solved 
the problem by rendering the verse, “Behold, I stand at the door and 
call.” [3] 

Bible translation points up some of the problems that exist for all 
translators, and recent translations of the Scriptures into English 
show an acute awareness of the specific problem that not every text 
in one language can be readily, idiomatically, and unequivocally 
translated into another tongue. Thus, while the King James, Douai, 
and 1917 Jewish Publication Society versions of the Bible are sin¬ 
gularly free of—or, at least, mighty sparing with—notes, more re¬ 
cent translations—be they Protestant, Catholic, Jewish, non- 
denominational, or interdenominational (such as the 38-volume 
Anchor Bible)—contain copious notes, explanations of doubtful 
readings, alternate versions, comments on uncertainty about certain 
obscure words, etc. It came as a shock to many people to discover 
that Moses and Jeremiah and Jesus and Paul didn’t speak the 
majestic-sounding English prose of the King James version as their 
native tongue, but it comes as an even greater shock to find out that 
the Bible they grew up with may have contained some questionable 
translations. 

This really shouldn’t be too surprising, though. Besides the cul¬ 
tural differences existing between the original writers of the Bible 
and us, there is also the time difference, and modern Bible trans¬ 
lators are quite correct in admitting their inability to capture the 
precise shade of meaning in an old or obscure Hebrew or Greek word. 
Their willingness to admit their limitations should serve as a shining 
example to all translators to be honest with the people who will have 
to read their translation. If a word is capable of variant translations, 
this fact should be conveyed to the reader, provided that there is no 
way of clearing up the ambiguity by research. 

Whoever it was who decided to translate mokusatsu by the one 
meaning (even though that is the first definition in the dictionary) 
and didn’t add a note that the word might also mean nothing stronger 
than “to withhold comment” did a horrible disservice to the people 
who read his translation, people who knew no Japanese, people who 
would probably never see the original Japanese text and who would 
never know that there was an ambiguous word used. As a matter 
of principle, that unknown translator should have pointed out that 
word has two meanings, thereby enabling others to decide -on a 
suitable course of action. 
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But that choice was lacking; the harsh decision was made on 
the basis of the one translation available; and the consequence 
was disastrous. To be sure, the case of mokusatsu is an extreme 
example, but there are many, many other words in scores of other 
languages that have equally discrete nuances of meaning. Choosing 
the wrong meaning or failing to inform a decision-maker that there 
are such nuances could well have unexpected repercussions. Sup¬ 
pose that someone quoted a report about a Cuban farmer who placed 
a bomb on his farm near the fences of the U. S. Naval Base at 
Guantanamo. Can you imagine the reaction of the Joint Chiefs of 
Staff, especially if no one told them that the Spanish words trans¬ 
lated as “placed a bomb” might also mean “installed a pump”? 

Many translators hesitate to admit publicly that they can’t give 
a precise equivalent for an ambiguous text. They feel that saying 
that a word can have two or more equally possible meanings is a 
sign of their inadequacy. They fail to realize that if they pick the 
wrong one, and later results prove that they guessed wrong, their 
prestige will sink even lower than if they had insisted that the term 
was ambiguous. But the fault isn’t always the translator’s. Quite 
often non-linguist supervisors fail to understand that languages 
don’t always have a one-to-one correspondence in their vocabularies 
and they will insist that the translator give them just one meaning 
and “cut out that ‘ambiguous word’ nonsense!” 

Even if some high American officials ever knew that the Japanese 
Premier had used the word mokusatsu , they probably wouldn’t 
believe that it could be translated in either of two ways. More than 
likely some high-ranking officer (probably a bird colonel) in the 
Pentagon asked the top Japanese translator (probably a nisei Pfc) 
about mokusatsu and then refused to believe the story about two 
meanings. You can almost picture this colonel pounding his desk 
and red-facedlv bellowing, “Whaddyamean telling me that the 
word means either ‘I am maintaining silence’ or ‘I’m treating it 
with contempt’? Dammit, private, I can’t go tell the Joint Chiefs 
something like that! I gotta give ’em straight facts, not multiple- 
choice tests! Now you quit beatin’ around the bush and give me 
one answer that I can tell ’em!” 

But even if this scene never happened in the Pentagon, the fault 
for the mokusatsu incident is not entirely the translator’s. Believe 
it or not, the real culprit is no less a personage than Kantaro Suzuki, 
the Japanese Prime Minister himself! [4] After all, there would have 
been no translation problem if he had not used an ambiguous word 
for such an important statement. 

However, politicians are notorious for preferring words that are 
either meaningless or so full of meanings that no one can be sure of 
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just what they do mean. In all probability the word mokusatsu was 
well beloved by Japanese officials as their equivalent of “No com¬ 
ment!” simply because it does have such a broad spectrum of 
meanings. A politician could use it and not really be saying anything 
he couldn’t squirm out of later; but it also left him the opportunity 
to claim later that he had long been against the course of action 
under discussion. Terms like these are the ones that Theodore 
Roosevelt once referred to as “weasel words” because politicians 
have sucked all the meaning out of them the way that a weasel 
sucks out the contents of an egg. 

Politicians aren’t the only ones who use meaningless words. News¬ 
men are also guilty of this sin, and so are many people right here 
in our own agency. For instance, TV journalists and NS A writers 
both seem to like the word “indicate” and they use it over and over 
and over in a variety of senses ranging from “hint” to “show con¬ 
clusively,” and even as a synonym for “say.” During World War II 
the word “aircraft” became the “in” word, despite its vagueness. 
In the interim since it gained such popularity, I have seen the 
word used to translate foreign texts where the original was extremely 
specific in talking about one airplane, several airplanes, a bomber, 
crop-dusters, several helicopters, a trainer, a seaplane, or a DC-4, 
among others. But the translators or reporters uniformly used the 
word “aircraft” in titles and texts. I have seen items with the word 
“aircraft” used throughout, and it was necessary to read several 
paragraphs—in one case, three whole pages—to find a pronoun re¬ 
ferring to the “aircraft” so that I knew whether the word was singular 
or plural. And I’ve even seen a few documents in which, even after 
careful reading and rereading, it was absolutely impossible to de¬ 
termine whether one or many “aircraft” was or were meant. 

There are many other examples of ambiguous words that could 
be cited, but I think that these two exemplify the second lesson 
that can be learned from the mokusatsu incident: Try to avoid 
ambiguous words! 

Some years ago I recall hearing a statement known as “Murphy’s 
Law” which says that “If it can be misunderstood, it will be.” 
Mokusatsu supplies adequate proof of that statement. After all, if 
Kantaro Suzuki had said something specific like “I will have a 
statement after the cabinet meeting,” or “We have not reached 
any decision yet,” he could have avoided the problem of how to 
translate the ambiguous word mokusatsu and the two horrible con¬ 
sequences of its inauspicious translation: the atomic bombs and 
this essay. 
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NOTES 

[1] The story is told in greater detail in William Craig’s The Fall of Japan, published 
by Dial, 1967. 

[2] The World’s Most Tragic Translation, unsigned article in Quinto Lingo, January 
1968, p. 64. 

f3] The linguistic problems cited in the preceding paragraph were solved by equally 
ingenious methods. Many Eskimos catch a seal, cook it, and eat it at a communal 
meal before a big fishing expedition; so translators more or less equate this pro¬ 
pitiatory seal with the sacrificial lamb eaten by the Israelites before their depar¬ 
ture from Egypt (Exodus, 12: 3-11) and the expression “Seal of God” is used for 
Lamb of God.’ At the other extreme, Saharan natives do use pegs or picket 
stakes driven in the sand as a means of “anchoring” their camels and horses at 
night. So the American Bible Society rendered “steadfast anchor for the soul” 
(Hebrews, 6:19) as steadfast picket peg for the soul” when they translated the 
Bible into the language of one of the Saharan nomadic tribes. 

[4] Unfortunately, Mr. Suzuki will never know that I blamed him for all this trouble. 
He died 17 April 1948. 
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Machine Translation—Where Is It Today? 

BY CLYDE D. McDONALD 


INTRODUCTION 

A standard textbook on data processing will probably list language 
translation as one of the uses of modern computing machinery, but 
will not go into any degree of detail. This is probably the easy way 
out for the author, for I submit that computer translation is prob¬ 
ably the most controversial and misunderstood computer application 
yet encountered. During the past ten years, magnificent progress 
has been made in applying computers to business operations (e.g., 
banking), and in the five-year period preceding that, the computer 
had similarly revolutionized scientific computation. The number of 
computers in the United States has grown in exponential fashion, as 
has the investment in computer facilities. Breakthroughs in pro¬ 
gram compilation, computer speed, memory devices, and so on have 
cascaded in dizzying announcements. Only the sky seems the limit! 
But in computer translation, there is a disquieting note, as recorded 
in an editorial in the Washington Post, November 27, 1966: “If the 
computer has really failed in the relatively simple task of transferring 
thought content from one language to another, the result is to chal¬ 
lenge the whole broad assumption that we are entering upon a new 
‘age of the computer’.” What is this phenomenon being commented 
on here? Is the task really “relatively simple?” Indeed, it is not, 
for language translation falls into that annoying category of com¬ 
puter applications known as “ill-structured,” a term which describes 
problems for which no straightforward computing procedure can be 
obtained from analysis. Another important point illustrated by the 
editorial mentioned above is that the press, and indeed the public, 
has been generally unkind to computer translation. A viewpoint 
has developed that language is one area that is human, not to be 
invaded by those dumb-brute machines which have taken over our 
payrolls, gasoline bills, and supermarket contests. Characteristic of 
this human pleasure at computer failures is the class of joke which 
has grown up in company with computer language translation; 
(sample: the computer translates the foreign language equivalent of 
“hydraulic ram” into English as “water goat”).* Computer trans- 


*See also Arthur Salemme, “The Case of the Volatile Mouses,” NS A Tech¬ 
nical Journal , Vol. V, Nos. 3 & 4, Fall, 1960. 
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lation is not alone in the group of “ill-structured” problems for which 
attempts at computerization have been made, but it does represent 
a problem which was recognized early and on which large sums were 
expended over a period of years. Thus it may be a test case for 
computerization of the “ill-structured” problem, since it is furthest 
evolved. It is a case to be considered from all sides in search of new 
meanings since it may apply to other “ill-structured” problems. 
With this background information I began my search for published 
material on automatic language translation. An attempt will be 
made in succeeding paragraphs to expose some of the more conclusive 
material reviewed in the research for this paper. 

TRANSLATING SYSTEM PARAMETERS 

In concept, a mechanical translation system would consist of three 
parts: an input, a processor, and an output. Output devices which 
would be acceptable are available, and input devices as well; the 
latter would include the class of optional character recognition equip¬ 
ment now feasible. The processor is the heart of the problem. For 
current purposes, a large-scale, general-purpose computer appears to 
be adequate. Thus, it appears that there are no significant equip¬ 
ment problems relating to computer translation, but rather, that 
such problems as exist lie in the translation procedures used in the 
programs. Five different approaches, of increasing complexity, are 
cited and defined as follows: 

Item-for-Item Substitution —Linguistic items, whether they be parts of 
words, whole words, or groups of words, are processed independently 
and sequentially converted into corresponding items in another lan¬ 
guage. In this approach, for large dictionaries, clerical errors can be a 
problem, and the usual multiplicity of equivalents must be resolved 
in some manner. It has been found to be a useful machine-aid 
process for human translation. 

Morphological Processing —This method is a refinement of the item-for- 
item process; it takes into account the rules for the creation of derived 
forms of words, in addition to the dictionary forms. 

Syntactic Translation —This process takes into account the rules by 
which words are arranged into sentences. Automatic parsing, the anal¬ 
ysis of the source language sentences by functional segment (as, subject, 
predicate, etc.), was unusually difficult, but after many years of effort 
reasonably complete programs for automatic parsing have been devel¬ 
oped. However, the problem of constructing grammatical sentences 
in the object language has not been developed to the same degree. 
Transformational Translation —Transformation rules indicate how 
various syntactic constructions in a language are related to, or may be 
derived from, other constructions regarded as simpler or more basic. 
Translation by transformational processing would involve the use of 
transformational grammars both for the analysis of the source language 
and for the generation of the object language. Fairly good transfor¬ 
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mational grammars for English have been developed but are incomplete 
for other languages. 

Semantic Translation —Semantic translation would be based on mean¬ 
ing, for, after all, it is the meaning which must be conveyed. Research 
has been conducted in this area, but much remains to be done. 

The current state of automatic language translation is such that, for 
some pairs of languages (for example, Russian to English), the pro¬ 
cedure is conducted at a level somewhat below syntactic translation. 
This has been attained through the expenditure of significant amounts 
of time and money, and with resulting output which can at best be 
described as only fair. Human post-editing remains a requirement. 
(Mr. Richard See has provided information on a recent development 
in mechanical translation which I was not aware of at the time my 
paper was prepared. The Atomic Energy Commission is now using 
a mechanical translation program, developed at Georgetown Uni¬ 
versity, at their Oak Ridge, Tennessee, facility. Rapid service is 
being provided to about twenty users of Russian translations, without 
post-editing. This was reported in Science , Vol. 155, January 1967, 
p. 59 and also by Francois Kertesz, “Babel—or Translation by Man 
and Machine,” Oak Ridge Report ORNL-TM-1723.) 

AN EXISTING SYSTEM 

Most of the support for automatic language translation in the 
United States has been provided by the Federal government. During 
the ten-year period, 1956-1965, this support amounted to approxi¬ 
mately $20,000,000, about half of which was provided through the 
United States Air Force. The major Air Force effort has been the 
development of the AN/GSQ-16 (XW-2) Automatic Language Trans¬ 
lator for the USAF Foreign Technology Division. In a report pro¬ 
duced in 1965, the Arthur D. LITTLE Company has evaluated the 
Foreign Technology Division’s system, which is described as a ma¬ 
chine-aided system. The purpose of the LITTLE study was not to 
justify or to defend previous expenditures, but to consider the al¬ 
ternatives of continuing the system, improving it, or abandoning it. 
The analysis was conducted in terms of quality, cost, and timeliness. 
The evaluation concluded that the quality of output is equal to that 
of standard human translations. The system translation cost at the 
Foreign Technology Division was found to be about $36 per thousand 
Russian words, excluding overhead and equipment amortization; 
in-house human translation cost, excluding overhead, was about $40 
per thousand Russian words. Contract translation cost was about 
$33 per thousand Russian words, including contractor’s overhead. 
Post-editing and recomposition were the main cost components in 
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the process, accounting for over 70% of the total; input processing 
accounted for only 11%. With regard to timeliness, and excluding 
high-priority items, all forms of human translation were found to 
require 85-110 days for translation of a moderate-sized article. In 
contrast, machine translations of articles of similar length required 
no more than an average of 45-50 days. The study reached the 
conclusion, for the near-term, that the system is both technically and 
economically feasible, and that it should continue to be operated as 
a production facility devoted to producing high-quality service to 
users of translations. 

ECONOMICS OF TRANSLATION 

In April, 1964, with funds made available by the Department of 
Defense, the National Science Foundation, and the Central Intelli¬ 
gence Agency, the Automatic Language Processing Advisory Com¬ 
mittee of the National Academy of Sciences—National Research 
Council was organized under the chairmanship of Dr. John R. Pierce. 
The report of this committee, published in late 1966, may be regarded 
as a defense of human translation in the near future. “The Com¬ 
mittee is puzzled by a rationale for spending substantial sums of 
money on the mechanization of a small and already economically- 
depressed industry with a full-time and part-time labor force of less 
that 5,000.” The Committee’s report is not optimistic with regard 
to future demand for larger quantities of translations, even if they 
were inexpensive and of high quality. A major criticism of current 
machine translation in the report is that post-editing takes almost as 
much human time as human translation from the original. The 
Committee believes that present practices in translation can be im¬ 
proved; some machine aids to translation, such as automatic dic¬ 
tionary lookup, can be used. Some technical articles of little value 
can simply remain untranslated. United States scientists should 
easily be able to learn enough Russian—the most important foreign 
technical language—to read papers in their field. The Committee’s 
report, indeed, presents such a dreary outlook that “the chairman 
was asked to prepare a brief statement of the support needs for com¬ 
putational linguistics, as distinct from automatic language translation. 
This request was prompted by a fear that the Committee report, read 
in isolation, might result in termination of research support for com¬ 
putational linguistics as well as in the recommended reduction of 
support aimed at relatively short-term goals in translation.” In 
the requested statement, Dr. Pierce said that “among the important 
kinds of research that need to be done and should be supported are 
(1) basic developmental research in computer methods for handling 
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language, as tools to help the linguistic scientist discover and state his 
generalizations, and as tools to help check proposed generalizations 
against data; and (2) developmental research in methods to allow 
linguistic scientists to use computers to state in detail the complex 
kinds of theories (for example, grammars and theories of meaning) 
they produce, so that the theories can be checked in detail. 

“The most reasonable government source of support for research 
in computational linguistics is the National Science Foundation. 
How much support is needed? Some of the work must be done on a 
rather large scale, since small-scale experiments and work with minia¬ 
ture models of language have proved seriously deceptive in the past, 
and one can come to grips with real problems only above a certain 
scale of grammar size, dictionary size, and available corpus. 

“We estimate that work on a reasonably large scale can be sup¬ 
ported in one institution for $600 to $700 thousand a year. We be¬ 
lieve that work on this scale would be justified at four or five centers. 
Thus, an annual expenditure of $2.5 to $3 million seems reasonable 
for research. This figure is not intended to include support of work 
aimed at immediate practical applications of one sort or another.” 

LINGUISTS AND PROGRAMMING LANGUAGES 

As programming languages have become more sophisticated, the 
relation between them and linguistic science has become more recog¬ 
nizable. Before FORTRAN, there was little effect of linguistics on 
programming. In the period from FORTRAN to ALGOL, 1956- 
1960, the connection was almost exclusively in terminology; words 
and definitions, but not theory and technique, were borrowed from 
linguistics (for example, grammar, syntax). Intense borrowing has 
occurred during the period from ALGOL to the present from current 
mathematical linguistic theory, technique, and notation. It is now 
possible to teach syntax analysis of programming languages, and to 
construct programs which are general-purpose syntax analyzers in 
the sense that they parse any programming language of a given type. 
Programming has, in turn, had its effect on linguistics by providing 
linguistics with real models and by providing pressure on linguistic 
theory, directing it to problems particularly relevant to computation. 
Special programming languages (SNOBOL and COMIT) have been 
developed for solving certain linguistic problems. What of the future 
relation between programming and linguistics? According to the 
report of Dr. Pierce’s Committee, “These (programming) languages 
will become sufficiently complex so that a theory of their semantics or 
evaluation will be a sufficiently interesting model for the equivalent 
problems arising in natural language translation. 
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“Similarly, there will be a reverse flow from the development of 
semantic theories within natural linguistics into mathematical lin¬ 
guistic models, which, in turn, will influence programming.” 

FOREIGN EFFORTS 

In order to do work in machine translation, it is almost essential 
that computers be available to the researchers, and American efforts 
have been enhanced by the enormous expansion of computing facilities 
available in this country. The Russian effort is stronger in theory 
than in practice, being associated with the Soviet interest in cyber¬ 
netics. They appear to be short on computer time, and deficient in 
alphameric input/output equipment. There is also a French effort, 
and a small but practical English one. West Germany is oriented 
toward machine aids to translation and has made some progress. 
There is some study in computer translation being conducted in 
Japan, and, presumably, in Communist China. 

CONCLUSIONS 

From a period of growing interest, publication, and expenditure in 
the automatic language translation field which ended about 1961, we 
have seen that a period of uncertainty, controversy, and evaluation 
has followed. Tentatively, we may mark the end of this period with 
the publication of the Pierce Committee report in November 1966. 
Perhaps only now can the relationship between computers and lin¬ 
guistics begin to show its true depth. Clearly the problems of com¬ 
puters in translation are more difficult than many realized, but the 
rewards, I believe, are potentially greater than imagined by' most. 
Man (whatever his language, and whether he speaks or writes) must 
co-exist with the computer (whatever its language and regardless of 
the I/O devices). The pressures to communicate with the computer 
(or at least, to describe processes to it) in increasingly natural English 
will reinforce man’s efforts to communicate, using the computer as an 
intermediary, with other men who speak a different language. The 
possibilities of real-time computer interpretation can add an entirely 
different perspective to the economic outlook for computer transla¬ 
tion. Certainly language is an heuristic phenomenon for the human 
beings who created it and use it, and perhaps this is the approach 
that will ultimately be required. On the other hand, if it be granted 
that dictionaries and grammars have a stabilizing effect on language 
(especially if government sponsored), is it not possible that the 
future may bring with it increasing demands for rigidity in language? 
These forces of standardization can be seen at work in a related 
field-indexing and information retrieval. Faced with vexing di¬ 
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versity in terminology, one frequently has recourse to glossaries which 
define the “right” way to say a thing. In another field, the computer 
today is enforcing its style of writing on the computer programmer. 
Perhaps the future will bring a resurgence of the authoritarian gram¬ 
mar of the one-room school house. For those who might feel this 
approach to be too rigid for human acceptance, I wonder if they feel 
the same regarding the publication of a new glossary? Standardiza¬ 
tion of language may already be making inroads and may prove more 
tolerable in practice than in contemplation. 

Acknowledgments 

Mr. Richard See of the National Science Foundation was extremely 
helpful to me in providing guidance and the loan of documents; he 
provided the initial background information which directed and 
focused my reading. Dr. Sidney Jaffe (A7) and Mrs. Mary J. 
Fabik (P2), were helpful in providing documents for study. The 
five approaches to mechanical translation described under “Trans¬ 
lating System Parameters” were derived from Mr. See’s 1964 article 
in Science. 

BIBLIOGRAPHY 

[1J Paul W. Howerton, “The Parameters of an Operational Machine Transla¬ 
tion System,” Mechanical Translation , Vol. 6, November 1961. 

[2] Anthony G. Oettinger, Automatic Language Translation , Harvard University 

Press, 1960. 

[3] D. Yu Panov, Automatic Translation, (translated from the Russian by 

R. Kisch, edited by A. J. Mitchell), Pergamon Press, New York, 1960. 

[4] John R. Pierce, Languages & Machines, Committee Report, National Academy 

of Sciences, Washington, 1966. 

[5J Richard See, “Mechanical Translation and Related Language Research,” 
Science, Vol. 144, No. 3619, May 8, 1964. 

[6] Victor H. Yngve, “A Programming Language for Mechanical Translation,” 

Mechanical Translation, Vol. 5, No. 1, July 1958. 

[7] An Evaluation of Machine-aided Translation Activities of FTD, Arthur D. 

Little, Inc., Contract AF33(657)-13616, 1965. 

[8] Current Research & Development in Scientific Documentation, National 

Science Foundation, 1966. 


29 















T 


Employment of Military Linguists 

BY JACOB GURIN 


Some of the tricks which one’s memory plays make the past seem 
so much more exciting and colorful then it really was. Embellish¬ 
ments and exaggerations slip unnoticed into the body of wartime 
stories as a result of passing years and numberless repetitions, and 
they serve to define and lend dimension to what otherwise might b^ 
a series of flat and colorless occurrences in the life of a G.I. But these 
are the recollections we have, and perhaps they may serve as a source 
of instruction, warning or amusement to others. 

One of the unsung heroes of World War II was the linguist serving 
in Military or Naval Intelligence. My own experience was as a trans¬ 
lator and interpreter of the Japanese language, one of a collection of 
odd birds who decided somehow that they could serve their country 
more effectively that way than in any other capacity. 

In addition to all the other problems facing the U. S. at the out¬ 
break of the war, we had the Japanese language. It seemed reason¬ 
able to expect that the Services would contain types who knew Ger¬ 
man or Italian, but where would we find anyone who knew Japanese? 
True, there were a few reliable sources of this skill. A number of mis¬ 
sionaries had made Japan their life’s work but had returned to the 
U. S. as conditions became more difficult for Americans there. A few 
academic types specialized in Japanese history, language and liter¬ 
ature, and there was the handful of businessmen who had worked for 
years in Japan. But except for the Nisei (Americans whose parents 
were Japanese immigrants) the number of Americans who were con¬ 
versant with Japanese was small. 1 The missionaries were seldom of 
military age, and the businessmen tended to run to specialized vo¬ 
cabularies and fluency in restricted areas of conversation or reading. 
Their children, however, provided the Services an aristocracy char¬ 
acterized by such feats as chattering in Japanese with amazing 
fluency about all sorts of trivia. 

The Services had been training a small number of linguists all 
along as a matter of policy. It quickly became apparent, however, 


That eminent authority on things linguistic, Life Magazine, in September 1942 
estimated that fewer than 100 Caucasians in the U. S. were competent Japanese lin¬ 
guists. 
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that the number would have to be increased enormously if we would 
equip our side with the linguistic skills which the war required. The 
Navy opened a school at Boulder, Colorado, and later on another at 
Stillwater, Oklahoma. The Army’s schools were at Ann Arbor, Mich¬ 
igan and Minneapolis, Minnesota. 

Our teachers were Japanese, Nisei, and Korean. They displayed 
varying degrees of teaching skill, but were dedicated to their task. Few 
had experience as teachers, and fewer still had ever been called upon 
to teach the Japanese language. We all learned together. 

We had both men and women teachers, and we mimicked their 
pronunciation and choice of words. Later on, in Japan, I found out 
how lucky I was to have studied under male teachers. Some of those 
who were not so fortunate were greeted with giggles as they solemnly 
pronounced their Japanese. Just as in English you would not regard 
as standard masculine speech such phrases as “just a darling shade 
of gray,” or “simply dying to meet that precious girl,” so in Japanese 
a man is regarded with suspicion if he uses feminine words or into¬ 
nations. If the maid knocks on one’s hotel room door and he answers 
“haire 2 ,” she will come in and change the linen or whatever. But if 
a presumably male voice simpers “O-haitte kudasai 3 ,” she may not 
even enter the room! 

The students were a mixed bag. We had a former lion tamer from 
the Ringling Bros. Circus, a small-time stand-up comic, the former 
coach of the Tokyo Eagles baseball team, some businessmen who 
had dealt with Japanese-speaking communities on the west coast 
and had become interested in the language, BIJ’s 4 who had diffi¬ 
culty in sitting down to learn the written language, and the major¬ 
ity—college types who felt that the study of Japanese would be their 
cup of tea. 

Most of us started from scratch in the language. The objective of 
the course called for feats of memorization, covering huge amounts 
of vocabulary and several thousand ideographs, including a surpris¬ 
ing number which had absolutely no application outside of Japanese 
mythology. Other unlikely goals were: fluency in the spoken lang- 
guage; appreciating Japanese culture; understanding the workings 
of the Japanese mind. What success we did achieve was attributable 
more to the dedication of students and faculty than to any virtues in 
the educational policies and procedures. 


2 Come in. 

3 Do me the honor of entering my unworthy room. 

4 Born in Japan. Sons of missionaries and businessmen. 
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Our instructors repeatedly assured us that eventually we would 
come to understand what we were doing, that explanations were dif¬ 
ficult or impossible, and that by some mysterious alchemy we would 
in time become sufficiently adept to do the war effort some good. 
Oddly enough, they were at least partly right! 

The weakest part of our course, as we soon found out, was the at¬ 
tempt to teach us the spoken language. Theoretically we were pro¬ 
vided the opportunity to practice conversation by teaming up several 
times a week with a fellow student, a Nisei who had a reasonable 
command of ordinary spoken Japanese. But his obvious boredom and 
resentment quickly shattered our pathetic attempts to converse, and 
after a while the whole thing was given up as an unworkable idea. We 
did not fare any better in understanding the Japanese movies which 
were used as teaching aids. During the showing of one of these, a 
domestic drama of some sort, the students cheered mightily when, 
after about one hour of total lack of comprehension, they understood 
the maid when she knocked on the door and said “Excuse me!” 

After a year of training, we were declared ready to go into operation. 
Our program had been amazingly comprehensive, and even included 
elementary training in judo. Because of my large size, I was always 
used to illustrate the advantages of judo in permitting a small man 
to deal effectively with a large one. In damp weather I still ache as a 
result of some of those falls. We also practiced bowing and squatting 
on our heels for prolonged periods, since we understood that these 
were favorite pastimes in Japan. Having added to this list of accom¬ 
plishments mastery of the words and music of “Kimigayo,” the 
Japanese national anthem, a pocketful of Kanji cards 5 , and a box of 
dictionaries, we went out into the world convinced that what we 
really needed was another year of training! 

We linguists were a scarce resource. Our paltry number was splin¬ 
tered in the attempt to cope with the many and pressing needs for 
our services. Some of us went to the Central Pacific, some to the 
South Pacific, some to the Southwest Pacific, and some even found 
their way to Washington and the Navy and Army predecessors of 
NSA. 

It was my good fortune to be sent to to the Southwest Pacific where, 
shortly after my arrival, I was given my first document to translate. 
It was a diary taken from the breast pocket of the uniform of a dead 
Japanese soldier. The book measured about l l A by 2 l A inches, and 
although the entries had been made in a fine hand, they proved al¬ 
most undecipherable as far as I was concerned. Could it have been 

5 Small cards with Japanese ideographs on one side, and pronunciation and mean¬ 
ing on the other—indispensable to the student. 
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the bullet hole through the center of the book? Or the fact that the 
book and its owner had lain on the beach for several days half-in and 
half-out of the water, causing the ink to run? They had not taught us 
to cope with this at school! I saved face the next day, when I plowed 
triumphantly through a printed manual entitled “How to Dig a 
Hole.” 

I became a member of a unique organization entitled Allied Trans¬ 
lator and Interpreter Service, better known as ATIS. Its headquarters 
were hidden away in an obscure suburb of Brisbane, Australia, called 
Indoroopilly. The record of this remarkable outfit is quite impressive. 
In its few years of operation 14,000 prisoners were interrogated and 
two million documents were scanned, the more important ones being 
translated in whole or in part. ATIS was truly an international affair, 
encompassing the American Army, Navy, and Air Corps, the Austra¬ 
lian Army, Navy, and RAAF, the New Zealand Navy, the British 
Army, Navy, and RAF, and featuring several Chinese generals. 6 

One of our big problems was to convince combat troops of the value 
of prisoners and documents. We had to argue convincingly, for our 
troops were understandably loath to take chances with an enemy who 
appeared to be surrendering but who would fling a grenade at the very 
last moment. The record shows a terribly small percentage of pris¬ 
oners captured in relation to the number of dead and wounded. For 
example, at Guadalcanal 40,000 were killed and about 600 taken 
prisoner. At Buna there were 43,000 counted dead and only 625 pris¬ 
oners, and this ratio remained fairly constant until just before the 
conclusion of the Philippine Campaign. 

Once a prisoner was captured, he talked with entire freedom and 
without reserve. With only very few exceptions, mostly among naval 
prisoners, he displayed a complete lack of security training. The 
Japanese government boasted that no Japanese would permit him¬ 
self to be taken prisoner, and so provided no instruction in how to 
behave if captured. Since the prisoners felt that capture was an un¬ 
speakable disgrace, and that knowledge of their capture would bring 
ruin and probable torture to their own relatives, every one, imme¬ 
diately upon capture, gave false information as to name and organi¬ 
zation. Of course, one of our jobs was to trip up the prisoner in the 
fabric of his own lies, and then he invariably broke down and apolo¬ 
gized. From that time on we would have his cooperation, and like 
most prisoners he would talk about going to the U. S., since he could 
never go back to Japan because of his disgrace. 

Col. Signey Mashbir, who was in command of ATIS, tells the story 


fi I never did determine their precise function, but we were proud to have them to 
show off to visitors. 
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of a bearded prisoner who would not cooperate. Someone thought of 
threatening him with a shave, to be followed by a photograph of his 
true features and distribution of the picture in Allied and neutral 
countries as our first prisoner. When he learned of these plans he 
talked without reservation. 

We had one prisoner whom we interrogated for 18 months. He had 
been a patent attorney in private life, and had a remarkable mem¬ 
ory. Perhaps he did make up some of the details, but he supplied an 
incredible amount of valid information on a variety of technical sub¬ 
jects. He was a pet of ours, and we took him to the GI movies on the 
base and occasionally into town, where in his borrowed Yank uni¬ 
form he looked like one of our boys. It was a good thing that the 
townspeople didn’t know who he really was! 

Sometimes our interrogations were not particularly fruitful, in 
the sense of bringing victory to our forces. One member of our staff, 
a white-haired ex-missionary, worked over the same prisoner for a 
week, and we all grew curious about the report which would result 
from this exercise. When it appeared at last it was almost entirely a 
conversion progress report, with the observation that, in time, the 
prisoner “could make a good Christian.” 

On another occasion, an American Naval officer, an Annapolis 
man, was privileged to grill a captured Japanese Naval officer, one of 
the few we ever got. It turned out that they spent a considerable por¬ 
tion of the “interrogation” comparing promotion dates for their 
classes and the careers of their classmates. 

Sometimes we acquired prisoners in bizarre ways. After the Japa¬ 
nese had been defeated in New Guinea, they fled to the jungle, where 
food was scarce and the natives unfriendly. Occasionally a half- 
starved Japanese would be caught trying to look inconspicuous as he 
awaited his turn in a GI chowline, mess-kit in hand. Our outdoor 
movies, located at the edge of the jungle, also acted as a magnet, and 
you could be sure that we often had an unseen and silent audience, 
especially for any picture featuring Betty Grable, a special favorite 
among the Japanese military. For Betty they would creep up- to the 
very edge of the clearing which served as our theater. 

While the troops were requested to send in all enemy documents, 
flags, notebooks—anything, in effect, which contained Japanese wri¬ 
ting 7 —censorship of outgoing mail provided many valuable items. 
One notable example was on its way from a soldier to his girl in Iowa 
and bore the comment: “How do you like the Jap writing? It looks 
like hen tracks, don’t it?” These “hen tracks” were an Intelligence 

7 We promised faithfully to restore these to the finder after the war. 
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Report from Imperial Headquarters giving an exact record of Allied 
planes which had been shot down during the preceding month. 

The value of these documents and the possible penalty for failing 
to turn them in was illustrated by the tragic story of a young lieuten¬ 
ant killed by machine gun fire from an emplacement he was leading 
his patrol to discover. In his musette bag was a document he had 
“souvenired” and kept six days earlier instead of turning it over to us 
for translation. It showed the exact location of the very gun that 
killed him. 

Every scrap was treasured. Many were extremely messy, as could 
be expected, since we were not above working with items covered 
with blood, body fat, etc. ATIS developed a Document Restoration 
Section to restore charred and soiled documents for the benefit of the 
translator’s eyes. Along with chemical baths and ultraviolet light to 
make illegible documents readable, we also set up a simple process 
of sponging and ironing pages of all documents on which writing was 
decipherable. Employing five WACs who had been laundry workers, 
an electric ironer and some hand irons, we processed about 20,000 
pages a day. 

As is true in most other professions, translating documents can 
become a routine occupation with only an occasional event worthy 
of recall. One such occurrence was related by A. W. (Mike) Foley, 
who had been doing a translating job for the Marines. He had to go 
through a warehouse full of captured ordnance, reading the labels 
so that the equipment could be sorted and some of it used against its 
former owners. It was a hot, dirty job, and when it was over the grate¬ 
ful Marines offered to share their chilled beverages with Mike. Al¬ 
though everyone else seemed to enjoy it, he thought that the drink 
had a strange taste, and on examining the label he found that it read 
FU-RI-T-TO (Flit). 8 

Our work as linguists would have been very much more difficult if 
we had not had the talents of the Nisei to support us. Those Nisei 
who worked at ATIS had at least a fair knowledge of Japanese, and 
some were truly outstanding in both Japanese and English. Our best 
teams consisted of a first-class Japanese linguist, usually a Kibei 
(American Nisei who had received his schooling in Japan) and an¬ 
other man, usually Caucasian, whose English was good and Japanese 
at least fair. For highly technical subjects we would add subject- 
matter experts to the team. 

Life for a Nisei in the Army could be dangerous. Those serving with 
combat troops were under suspicion from their Caucasian comrades 
and at times were fired at from both sides. The Australians were at 

8 For you youngsters: everyone in 1945 knew that Flit was an insecticide. 
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first loath to accept their aid, but there simply weren’t enough Cau¬ 
casians for all the urgent language tasks. It was found to be necessary 
to issue to each Nisei serving with Aussies an identity card showing 
him to be a member of the Australian Expeditionary Forces. They 
were urged to wear “digger” hats and we tried to make sure that they 
were always accompanied by Aussie officers. The Nisei soon proved 
their worth, and all who served with Australian combat troops re¬ 
ceived letters of commendation. 

While most of the Nisei remained enlisted men, regardless of their 
obvious merits as linguists and intelligence specialists and their skill 
as soldiers, just before hostilities ceased permission arrived to com¬ 
mission one hundred of them. Many of these were my good friends. 
One in particular was George Kayano, who wrote the following about 
his experiences: 

“Yesterday I was commissioned 2nd Lt in the Army of the U. S. Sure makes 
one feel odd. For a while I felt lost. 

Today I received my first assignment. I am to be one of the few to witness the 
arrival of the Emissaries from Japan, to receive the surrender instructions 
from the Supreme Allied Commander of the Pacific. I was worried about the 
coming great event, also realizing that the whole world was waiting for this 
moment.” 

He then described in detail exactly what happened, the prepara¬ 
tions made to receive the Emissaries, the landing of the plane, the 
debarking of the Japanese mission. He recorded his conversation 
with the Colonel who rode in his car, who stated that Hiroshima was 
60 percent destroyed and that about 130,000 people had been killed 
or injured by the first atomic bomb. He concluded his letter with this 
paragraph: 

“That was one night in my life that I shall never forget. I believe that the 
United States is the only nation where a nobody like me can participate in an 
international function.” 

Now the war was over, but there was as much need for our services 
as before, perhaps more. Senior linguists were promptly distributed 
among general and flag officers to act as interpreters. I drew the CG 
of 8th Army, Gen. Eichelberger, but envied my friend Major Bowers, 
who outranked me and therefore became Gen. MacArthur’s inter¬ 
preter. He was the one who got the glamour duties, such as greeting 
the Emperor and conducting him to the General’s apartment on the 
occasion of their historic meeting. 9 

When we flew into Japan in the middle of August 1945 we didn’t 
know what to expect. Neither did the Japanese, I suppose, for while 
several of us stood there, practicing our Japanese, we overheard one 

9 After that, his principal linguistic duties consisted of taking Mrs. MacArthur 
shopping in the Japanese market. 
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old lady say to another: “You see, I told you that English wouldn’t be 
difficult. I can almost understand them now!” 

Headquarters for the occupation was set up initially in Yokohama. 
It was an exciting place to be and an exciting time; one was conscious 
of participating, at least peripherally, in the making of history. We 
were surrounded by famous names: Wainwright, MacArthur, Stas- 
sen. 10 One officer came up to me as I sat at the desk marked “G-2, 
8th Army,” and asked for an interpreter. He was hatless and wore 
two stars on the collar of his shirt; I didn’t recognize him as any of 
our generals. He explained that he wanted to survey the damage 
done by the atomic bomb in Hiroshima and Nagasaki, and then I 
realized that this “general” was Adm. Richard E. Byrd, the famous 
explorer! For a moment dreams of glory flashed through my mind, 
but then I thought of the look he would have on his face as I thumbed 
clumsily through my dictionary looking up the Japanese for “hemo¬ 
globin” or “skin eruptions.” I sighed and assigned him one of our 
better Nisei sergeants. 

A little while later, while at the same desk, I got an urgent call 
from CIC—Tojo had shot himself and appeared to be dying. A lin¬ 
guist was desperately needed to get his last word. I reflected briefly 
on the prospect: There’s Tojo, surrounded by reporters. He mumbles 
something and dies. “What did he say?” the reporters asked me. “I 
don’t know, I didn’t hear it clearly,” I respond. Heretofore my inter¬ 
preting experience had permitted me to say “Please speak more 
slowly. Please repeat that word. Say it another way. Please write it 
in Japanese.” I found another good Nisei sergeant to attend to Tojo, 
admitting shyly that I would rather do the job myself but that I had 
to stay with the desk. 

My boss, General Eichelberger, called me in a few days later and 
pointed to a samurai sword on his desk. It was an elaborate affair 
which had been in Tojo’s family for about 800 years, and the scab¬ 
bard was adorned with beautiful designs and valuable stones. The 
general looked at me rather sternly 11 and asked what the inscription 
said. It consisted of three ideographs, so elaborately engraved that I 
couldn’t possibly know what they were. And even if I could make 
them out, nothing in my background or reading would have ac¬ 
quainted me with a Sino—Japanese aphorism of the 12th Century. 
I gulped and asked permission to study the problem, which meant 


10 Yes, CDR Harold Stassen, USN. I don’t know why he was there, but he got a lot 
of attention. 

“Could it be that he was beginning to doubt the Army’s wisdom in assigning me to 
be his interpreter? 
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that I sent in a Nisei sergeant who was well versed in classical liter¬ 
ature. I half-hoped he would fumble it, but he had no trouble at all. 

I finally found a job which suited my talents. I was put in charge 
of a train headed for western Japan to pick up liberated Allied pris¬ 
oners of war. My first task after arriving in Osaka was to establish 
communications with Hqs back in Yokohama. I got on the telephone 
and found to my dismay that the operator couldn’t make out my 
schoolboy Japanese and that I didn’t have the faintest notion of what 
she was saying. I was rescued by an American Army lieutenant, a 
POW who had released himself. He asked whether he could help and 
to my surprise he did very well in spite of shocking grammatical 
errors. He explained that he had learned Japanese quickly because 
in camp one obeyed a barked order promptly or was encouraged by a 
rifle butt. On the way back to Yokohama with the trainload of liber¬ 
ated prisoners, I met a U. S. Navy lieutenant who had been trained 
as a Japanese linguist but who had carefully hidden this fact during 
his imprisonment for fear of reprisal. 

The time had come for the Japanese to disband their Army. This 
task fell to the Imperial General Staff, and I was assigned to super¬ 
vise the planning action. The Staff had prepared a list of military 
units, showing their locations and strengths, and had brought along 
an interpreter to read the list aloud in English. He was a Hawaiian 
who had gone to Japan as a youngster, had stayed on and became a 
lieutenant in the Japanese Army. His rendition of the list was hesi¬ 
tant and not entirely reliable, so I took over. As a linguistic challenge 
this was pathetic, but my performance caused a halt in the proceed¬ 
ings while the entire General Staff inquired of me how I had learned 
the language and shook their heads sadly when I told them about the 
service schools. They were astounded when I revealed to them that 
these schools had turned out hundreds of graduates who had mas¬ 
tered 12 their language. They appeared crushed at the thought that 
their language had offered them so little protection. 

Communication between the occupation forces and the people 
often broke down in spite of good-natured attempts on both sides to 
make do with sign language and pidgin English. On one occasion the 
MP’s were enforcing an occupation edict that the streetcars were not 
to be dangerously overcrowded. Several would-be passengers were 
chased off their precarious perches on one of the cars. One of this 
number persisted in going back to the car in spite of the MP’s ad¬ 
monitions, and he finally had to be hauled off to the local police sta- 

12 A little boastful inaccuracy should be expected from a member of the victorious 
occupying forces. 
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tion^ Luckily, one of our linguists happened to be at the police sta¬ 
tion 13 and he informed the MP’s that the man they had arrested was 
the ticket-taker. 

Obviously, if one relates his wartime exploits, he saves the best 
for last. The final anecdote should permit the storyteller to retire in 
quiet triumph, leaving his audience to glow for a few minutes in 
admiration and envy. I had been asked by the War Department to 
recruit civilian linguists in the U. S. to replace the military who were 
leaving the service. My wife and I traveled about the country for sev¬ 
eral months, looking up old friends and subjecting them to merciless 
examinations for pitiful C.A.F. ratings. In Los Angeles we attended 
a dinner given by the Japanese-American Citizens’ League, and my 
wife was the only one there who did not speak Japanese. I had taught 
her a few useful words and phrases, however, assuring her that with 
these she could sail through any emergency. At one point in the 
evening she turned to the young lady who was sitting next to her, and 
asked “Benjo-wa, doko desu-ka?” The response was an utterly blank 
look, and the question had to be repeated. Finally light dawned, and 
our young neighbor said, rather archly, “I know what you mean, but 
we never say it that way! 14 My wife informed me in a soft voice that 
it was Berlitz for her from that point on. 


13 For obvious reasons I won’t identify him. 

14 It was as though she had asked her hostess, at a posh party, where the john was! 
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The Voynich Manuscript 

“The Most Mysterious Manuscript in the World” 

BY BRIGADIER JOHN H. TILTMAN 


The following paper is a slightly expanded version of a paper which I 
delivered to the Baltimore Bibliophiles on March 4, 1967. I am fully 
aware of the inadequacy of my treatment of the subject. The paper is 
intended only as an introduction to the study of the manuscript for anyone 
approaching it for the first time. 

INTRODUCTION 

The Voynich Manuscript is a vellum book of over 200 pages. There 
is text on almost every page in an unknown script. There are also 
coloured drawings on all but about 20 pages. 

Plate 1 will give you an idea of what the script looks like. Plate 2 is 
an example of an illustrated page.* 

To the best of my knowledge there is no confirmed solution of the 
script or any part of it, and the authorship and general dating of the 
manuscript is totally unknown. With the exception of a small number 
of later additions (not in the unknown script), the character of the 
script and general behavior of the symbols appear to be constant 
throughout the book. Opinions differ as to whether the whole is by 
one hand. There appear to be no erasures or corrections, which sug¬ 
gests that the manuscript as we see it is likely to be a copy of an 
original which may be of an earlier date. In any case the writing of the 
manuscript and the painting of the illustrations must have been a 
major undertaking. The late Father Petersen made his own transcrip¬ 
tion of the manuscript without the illustrations, and it occupied him 
for, I believe, three or four years. 

Only a comparatively small part of this paper is original, i.e., has 
not appeared in print before. My purpose in writing it is to widen the 
circle of those who might be interested in the manuscript. I am a 
working man and have not been able to devote much time to its study 
and am fully aware of the many deficiencies in my knowledge. 

DESCRIPTION AND HISTORY 

In 1912 the manuscript was purchased by the late Mr. Wilfred M. 
Voynich (later a rare book dealer in New York) who ^discovered it in 

*A11 plates appear at the end of the article. 
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a chest in an ancient castle in Southern Europe.” It is now owned by 
Mr. H. P. Kraus, the New York antiquarian bookseller, who has 
revealed that it was found at Mondragone. This is a villa in Frascati 
near Rome, built by Cardinal Altemps about 1570. In 1582 Pope 
Gregory XIII issued from Mondragone the bull reforming the calendar. 
The villa apparently continued in the Altemps family as in 1620 a 
later member of the family bequeathed the Mondragone library to the 
Vatican library. In 1865 the villa became a Jesuit college which was 
finally closed in 1953. From 1912 to 1919, Voynich attempted to 
interest scholars in Europe and America in solving the script, while 
trying himself to determine the origin of the manuscript. 

The manuscript, when discovered, was accompanied by a letter, 
shown in Plate 3. With Mr. Kraus’s permission I quote from his 
catalogue number 100, entitled Thirty Five Manuscripts, the passages 
“History of the Manuscript” and “Conjectures concerning the early 
history of the Manuscript:” 

HISTORY OF THE MANUSCRIPT 

The manuscript enters recorded history on the 19th of August, 1666, when 
Joannes Marcus Marci of Cronland sent the codex from Prague to Athanasius 
Kircher, at Rome, with a signed autograph letter, which is found loosely laid into 
the manuscript. It reads as follows (transl. from the Latin): 

“Reverend and Distinguished Sir, Father in Christ: 

“This book, bequeathed to me by an intimate friend, I destined for you, my 
very dear Athanasius, as soon as it came into my possession, for I was convinced 
that it could be read by no one except yourself. 

“The former owner of this book asked your opinion by letter, copying and 
sending you a portion of the book from which he believed you would be able to 
read the remainder, but he at that time refused to send the book itself. To its 
deciphering he devoted unflagging toil, as is apparent from attempts of his which 
I send you herewith, and he relinquished hope only with his life. But his toil 
was in vain, for such Sphinxes as these obey no one but their master, Kircher. 
Accept now this token, such as it is and long overdue though it be, of my affec¬ 
tion for you, and burst through its bars, if there are any, with your wonted 
success. 

“Dr. Raphael, a tutor in the Bohemian language to Ferdinand III, then King 
of Bohemia, told me the said book belonged to the Emperor Rudolph and that he 
presented to the bearer who brought him the book 600 ducats. He believed the 
author was Roger Bacon, the Englishman. On this point I suspend judgement; 
it is your place to define for us what view we should take thereon, to whose 
favor and kindness I unreservedly commit myself and remain, 

At the command of your Reverence 
Joannes Marcus Marci 
Of Cronland 


Prague, 19th August, 1666.” 

The last numeral of the date has been altered by pen from “5” to “6,” obviously 
by Marcus himself. The emperor Rudolf II (1552-1612) was a scholar rather than 
a man of affairs; he neglected his duties as ruler of his realm in order to devote 
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himself to the study of alchemy, astrology, and astronomy, and he was the patron of 
Tycho Brahe, Kepler, John Dee, and a host of other scientists and pseudo-scientists. 
He resided in Prague throughout most of his reign, where he assembled a great 
collection of books and art objects. Rudolf, after acquiring the book, apparently 
loaned or gave it to Jacobus Horcicky de Tepenecz (died 1622), whose name, “Jacobi 
a Tepenece,” is written on the recto of the first leaf (erased, but easily visible under 
ultra-violet light). The form of the name shows that the book must have been in his 
hands after 1608, when the “de Tepenecz” was acquired by patent of nobility from 
the Emperor. Tepenecz was the director of Rudolfs alchemical laboratory and his 
botanical gardens. 

Dr. Raphael (1580-1644), a lawyer and minor poet, who supplied information con¬ 
cerning the book to Marcus, was connected with the Imperial court under Emperors 
Rudolf II, Ferdinand II, and Ferdinand III, and thus was in an excellent position to 
have obtained knowledge concerning the codex. He was attorney-general of Bohemia 
under Ferdinand III. 

Joannes Marcus (1595-1667), the writer of the letter, was Rector of the University 
of Prague, and a noted physician, mathematician, and orientalist. He was official 
physician to the Emperor Ferdinand III. In 1667 he was elected a corresponding 
member of the British Royal Society. He had studied under Kircher, at Rome. 

The connection of the cipher manuscript with the famous Athanasius Kircher, S.J. 
(1601-1680), is especially intriguing. He was one of the foremost scholars of the 
Jesuit order, keenly interested in problems of decipherment and the author of three 
works on an attempted solution of the Egyptian hieroglyphics, and of another work 
( Polygraphia , 1663), on codes and ciphers in general. In the XXth century the codex 
was acquired by Wilfried M. Voynich, a dealer in manuscripts, in 1912, who dis¬ 
covered it in a chest “in an ancient castle in Southern Europe.” 

CONJECTURES CONCERNING THE EARLY HISTORY 
OF THE MANUSCRIPT 

In his letter, Marcus says, “He (apparently the Emperor Rudolf) believed the 
author was Roger Bacon, the Englishman.” Bacon (12147-1294), famous scientist 
and philosopher of the Middle Ages, studied and taught at Oxford and Paris, and 
is believed to have died at Oxford. Professor Newbold, who considered this manu¬ 
script to be in the autograph handwriting of Roger Bacon, conjectured that his 
papers were acquired after his death by some English monastery; that the present 
one, on the dissolution of the monasteries after 1538, was acquired by John Dudley, 
Duke of Northumberland (15027-1553); that John Dee (1527-1608) acquired it from 
the Duke or some other member of the Dudley family; and that Dee, who lived in 
Prague for several years and was personally acquainted with Rudolf II, sold it to 
him. 

An interesting point is that John Dee, while in Bohemia from 1585 to 1588, 
possessed 

“a book . . . containing nothing but hieroglyphicks; which book his father 

bestowed much time upon, but I could not hear that he could make it out” 

(Sir Thomas Browne, quoting Dr. Arthur Dee, 1579-1651, the son of John Dee.). 
It is indeed very probable that the present volume was the one which Arthur Dee 
saw in his father’s hands; it would seem from the tenor of this reference, that the 
elder Dee was trying to extract some meaning from the book. It is quite possible 
that the Emperor, after purchasing it, had entrusted it to him for decipherment. 
The word “hieroglyphicks” would not, of course, refer at the time to Egyptian 
writing specifically, but to any secret alphabet such as that of the present codex. 
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Plate 4 is a photograph of the bibliographical description of the 
manuscript in the Kraus catalogue. 

In the manuscript there are 125 pages of botanical drawings, 26 of 
astronomical (or astrological drawings), 28 of so-called biological 
drawings, and 34 of pharmaceutical drawings, nearly all accompanied 
by text. At the end there are about 20 pages of script without il¬ 
lustrations. Plates 5 to 15 show examples of the appearance of the 
pages. 

Plate 5. —The plant depicted here has been identified as some sort 
of Bindweed (Convolvulus). 

Plate 6 . —This seems to me a fairly natural representation of cross¬ 
leaved Heath (Erica). 

Plate 7.—This has been identified as sunflower, giving rise to the 
hypothesis that the manuscript cannot be dated earlier than 1493, 
when Columbus introduced sunflowers into Europe. This identifica¬ 
tion has not been universally accepted. 

Plate 8. —This is an example of the many drawings which appear to 
be composite and cannot be identified as any one plant. I should 
perhaps apologise for the lack of definition in some of these illustra¬ 
tions. This one was prepared from my photostat which was repro¬ 
duced from Mr. Friedman's photostat, which was itself a copy of a 
photostat made many years ago by Father Petersen. 

Plate 9. —This is one of the astronomical illustrations. 

Plate 10. —This is one of the so-called zodiacal illustrations. There 
were presumably either one or two for each month, but the pages for 
two months seem to be missing. For each month there are 30 (or 
about 30) human figures round the edge of the circle. 

Plate 11 shows two pages tor April. One page has a black bull 
and 15 unclothed ladies, and the other, a white bull and 15 clothed 
ladies. All the zodiacal drawings carry the name of the month in the 
centre in a later hand and in readable script though the language has 
been disputed. 

Plate 12.— This is one example of the illustrations in the biological 
part of the book. I have not myself studied these pages, and ideas as 
to their meaning advanced by specialists in medieval and early Renais¬ 
sance history are completely outside my field. 

Plate 13. —This drawing is an example of one of many pages which 
seem to comprise a sort of pharmacopeia. 

Plate 14. —This is the first page. There is no title page, but this 
appears to be a table of contents, there being four paragraphs cor¬ 
responding to the four illustrated portions of the manuscript. 

Plate 15. —This is the final page and has on more than one occasion 
been assumed to be a key to a cipher used for producing the text as we 
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see it. You will see that only one short passage is in the unknown 
script used for the rest of the manuscript and that there appears to be 
some evidence of Latin words and at the end an unfinished Old English 
or Old German sentence. 

PROPOSED SOLUTIONS 

There have been three published solutions, none of them generally 
accepted. 

(1) William R. Newbold, professor of moral and intellectual philos¬ 
ophy at the University of Pennsylvania, began work on the manuscript 
in 1919 and in April 1921 announced that he had discovered the key to 
a cipher, that he was convinced that the author was Roger Bacon, and 
that he had decyphered portions of it. He said that his decypherment 
proved that Roger Bacon had possessed both a telescope and a micro¬ 
scope, although history places the invention of these several centuries 
after his death, and further that one of the drawings depicted the great 
Andromeda Spiral Nebula, of whose existence he, Newbold, had been 
previously unaware. His solution, which was accepted at the time by 
Voynich and a number of scholars, was eventually demolished, partic¬ 
ularly by Professor John M. Manly, Chairman of the English Depart¬ 
ment of the University of Chicago, in an article “Roger Bacon and the 
Voynich MS.,” Speculum , July 1931, and now has no supporters. His 
complex method of decypherment (I quote from an article by Mrs. 
Friedman in the Washington Post of 5 August 1962) “was reducible to 
nine steps. The first and last of these, without any consideration of 
the intermediate abstruse and confusing processes, are utterly devoid 
of precision and are incapable of yielding one and only one plain 
language text—a rigid requirement of any legitimate cipher method. 
His first step was to convert the individual strokes of each symbol 
into Greek shorthand, a process of which Newbold himself said: T 
frequently find it impossible to read the same text in exactly the 
same way.’ The reason for this, palaeographers say, is that what 
Newbold saw as separate strokes of a symbol are merely the results 
of the cracking, uneven spreading and fading of the ink, and the 
condition of the vellum because of the manuscript’s age.” Newbold 
died in 1926, and two years later his literary executor published a full- 
sized book from his voluminous notes and worksheets— The Cipher of 
Roger Bacon. 

Plate 16 is an illustration from his book showing his interpretation 
in the form of shorthand strokes of a letter of the script; the page 
illustrated is one of the astronomical drawings and the arrangement of 
the stars near the portion of script which the author has chosen for 
illustration suggests Aldebaran and the Hyades. 
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The controversy over Newbold’s solution left a legacy of ill-feeling 
which persisted for many years and which I found reflected in a letter 
which Charles Singer wrote to me in 1957. 

(2) In 1943 a Rochester lawyer named Feely published a book 
entitled Roger Bacon s Cipher; The Right Key Found. Feely was the 
author of some published items in the field of the Shakespeare author¬ 
ship controversy. His unmethodical method produced text in unac¬ 
ceptable mediaeval Latin, in unauthentic abbreviated forms. 

(3) In 1946, a research scientist, Dr. Leonell C. Strong, published 
a different interpretation of part of the manuscript. He claimed to 
have decyphered part of two pages, one of them the Sunflower page 
shown in Plate 7. He said that his decypherment revealed to him that 
the author was Anthony Askham and that the plain language text was 
in mediaeval English. Again his mediaeval English is not acceptable 
to scholars. He did not reveal the details of his key but his description 
of the cipher method makes no sense to cryptologists. However, in 
the process of preparing this paper I came across the following curious 
fact. Among the many editions of an illustrated English printed 
work known generally as Banckes’s Herbal, the first dated 1525, are 
two usually attributed to Anthony Askham, dated 1550 and 1555. 
Their title pages read:—“A little herbal of the properties of herbs 
newly amended and corrected, with certain additions at the end of the 
book, declaring what herbs hath influence of certain stars and constel¬ 
lations, whereby may be chosen the best and most lucky times and 
days of their ministration, according to the moon being in the signs of 
heaven, the which is daily appointed in the Almanac, made and 
gathered in the year of our Lord God 1550, the 12 day of February 
by Anthony Askham Physician.” The strange thing about this is 
that the astrological additions promised the reader, appear nowhere 
in either edition. Anthony Askham, a physician and clergyman, also 
published a number of almanacs. I managed to get a sight of one of 
these in a Library of Congress microfilm but found it only to be a 
Christian calendar on half a dozen pages. 

In 1950 I was introduced to the manuscript by my friend, Mr. 
William F. Friedman, who gave me photostats of a few of the pages 
to work on, chiefly the unillustrated pages at the end. From these 
pages I made a preliminary analysis of the text, disregarding all but 
the most commonly occurring symbols. For purposes of the present 
paper, I have found it necessary to substitute for the symbols con¬ 
ventional and quite arbitrary letters and figures, as shown at the top 
of Plate 17, very similar to the system previously devised by Mr. 
Friedman for discussion of the problem with a study group which he 
brought together for a short time at the end of World War II. You will 
see that I have limited the number of symbols to 17. The second 
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symbols which I have placed at positions 1, 2, 15, 16 and 17, I have 
treated, rightly or wrongly, as variant forms. As a result of my analysis 
I made a report to Mr. Friedman in 1951 of which the following is the 
gist (only slightly revised). 

“(a) Following are some notes on the common behaviour of some of 
the commonly occurring symbols. I would like to say that there is no 
statement of opinion below to which I cannot myself find plenty of 
contradiction. I am convinced that it is useless (as it is certainly 
discouraging) to take account at this stage of rare combinations of 
symbols. It is not even in every case possible to say what is a single 
symbol and what is not. For example, I am not completely satisfied 
that the commonly occurring A has not to be resolved into Cl or 
possibly 01. I have found no punctuation at all. 

(b) DZ and HZ appear to be infixes of D and H within T. The 
variant symbol represented by E appears most commonly at the end 
of a line, rarely elsewhere. 

(c) Paragraphs nearly always begin with D or H, most commonly 
in the second variant forms, which also occur frequently in words in 
the top lines of paragraphs where there is some extra space. 

(d) G occurs quite frequently as the initial symbol of a line followed 
immediately by a combination of symbols which seem to be happy 
without it in any part of a line away from the beginning. Otherwise 
it occurs chiefly before spaces very frequently preceded immediately 
by 8. Hence my belief that these two have some separative or con¬ 
junctive function. (I have to admit, however, that G also seems 
sometimes to take the place of O before D or H (though rarely, if ever, 
after 4); this is particularly noticeable in some of the captions to 
illustrations in the astronomical section of the manuscript—these most 
commonly begin OD or OH and it is here that we occasionally see GD 
or GH.) 

(e) I have tried, for convenience of handling, to divide words into 
what I call “roots” and “suffixes.” This arrangement is shown at the 
bottom of Plate 1 7. Regarding the second type of suffix, some of the 
combinations are so rare that I have been uncertain whether to take 
any account of them at all. Some are very common indeed. It seems 
to me that each of these combinations beginning A has its own char¬ 
acteristic frequency which it maintains in general throughout the man¬ 
uscript and independent of context (except in cases where two or more 
A groups are together in series, as referred to later). These A groups, 
e.g., AR or AIIL, frequently occur attached directly to “roots,” 
particularly OD, OH, 8 and 2. ODAIIL, 40DAIIL and 8AIIL rank 
high among the commonest words in the manuscript. 
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(f) There are however many examples of 2, 3, 4 or even 5 A groups 
strung together on end with or without spaces between them. When 
this occurs, there appears to be some selective preference. For example, 
AR is very frequently doubled, i.e., AR AR, whereas AIIL which is 
generally significantly commoner, is rarely found doubled. Perhaps 
the commonest succession of three of these groups is AR AR AE. 
AE very frequently follows AR, but AR hardly ever follows AE. 

(g) O, which has a very common and very definite function in 
“roots,” seems to occur frequently in “suffixes” in rather similar usage 
to A, but nearly always as OR and OE. OR AIIL is very common. 

(h) The behaviour of the A (and O) groups has suggested to me that 
they may in fact constitute some form of spelling. It might be, for 
instance, that the manuscript is intended to demonstrate some very 
primitive universal language and that the author was driven to spell 
out the ends of words in order to express the accidence of an inflected 
language. If all the possible A and O combinations can occur, then 
there are 24 possibilities. They may, however, be modified or qualified 
in some way by the prefixed symbols D, H, OD, OH, T, S, 8, 2, etc., 
and I have not so far found it possible to draw a line anywhere. This, 
coupled with ignorance of the basic language, if any, makes it difficult 
to make any sort of attempt at solution, even assuming that there is 
spelling. 

(i) E, usually preceded by A or 0, is very commonly followed by D, 
much less commonly by H, with or without a space between. In this 
connection, I have become more and more inclined to believe that a 
space, though not intended to deceive, must not necessarily be re¬ 
garded as a mark of division between two words or concepts. 

(j) Speaking generally, each symbol behaves as if it had its own 
place in an “order of precedence” within words; some symbols such as 
O and G seem to be able to occupy two functionally different places. 

(k) Some of the commoner words, e.g., ODCCG, 0DCC8G, 
40DCC8G, ODAIIL, ODAR, ODAE, 8AIIL, TC8G occur twice 
running, occasionally three times. 

(l) I am unable to avoid the conclusion that the occurrence of the 
symbol C up to 3 times in one form of “suffix” and the symbol I up to 3 
times in the other must have some systematic significance. 

(m) Peter Long has suggested to me that the A groups might repre¬ 
sent Roman numerals. Thus AIIL might be IIJ, and AR AR AE 
XXV, but this, if true, would only present one with a set of numbered 
categories which doesn’t solve the problem. In any case, though it 
accounts for the properties of the commoner combinations, it produces 
many impossible ones. 
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(n) The next three plates show pages where the symbols occur singly, 
apparently in series, and not in their normal functions. The column 
of symbols at the left in Plate 18 appears to show a repeating cycle 
of 6 or 7 symbols D (or H), 0, 2, G, C,?. In Plate 19 the succession of 
symbols in the circles must surely have some significance. One circle 
has the same series of 17 symbols repeated 4 times. Plate 20 also has 
an interesting column of symbols. In all three there are symbols which 
rarely, if ever, occur elsewhere. 

(o) My analysis, I believe, shows that the text cannot be the result 
of substituting single symbols for letters in the natural order. Lan¬ 
guages simply do not behave in this way. If the single words attached 
to stars in the astronomical drawings, for instance, are really, as they 
appear to be, captions expressing the names or qualities of those stars, 
there can hardly be any form of transposition system involved. And 
yet I am not aware of any long repetitions of more than 2 or 3 words 
in succession, as might be expected for instance in the text under the 
botanical drawings.” 

After reading my report, Mr. Friedman disclosed to me his belief 
that the basis of the script was a very primitive form of synthetic 
universal language such as was developed in the form of a philosophical 
classification of ideas by Bishop Wilkins in 1667 and Dalgarno a little 
later. It was clear that the productions of these two men were much 
too systematic, and anything of the kind would have been almost 
instantly recognisable. My analysis seemed to me to reveal a cum¬ 
bersome mixture of different kinds of substitution. When I was 
attempting to trace back the idea of universal language, I came upon a 
printed book entitled The Universal Character by Cave Beck, London 
1657 (also printed in French in the same year). Cave Beck was one of 
the original members of the British Royal Society and his system was 
certainly a cumbersome mixture. 

Plate 21 shows the title page. His system is based on a dictionary 
code of about 4,000 words to which are alloted the numbers from 
1-3999 in alphabetical order. 

Plate 22 is a page from his dictionary covering the words “That 
(conjunction)” to “till or untill.” When digital code-groups are un¬ 
accompanied by letters, they represent verbal infinitives, e.g., “to 
tickle”-3773. Groups are preceded by the letter R when a noun is 
indicated as in the case of most of the groups on this page; if the 
adjectival form is intended, the digital group is preceded by Q. In the 
case of words which the author regards as synonymous with words 
earlier in the alphabet, the earlier code-group is given, e.g., “to 
thinke”-1163, where “to cogitate” is found on an earlier page opposite 
1163. Prepositions are represented by the corresponding word in 
Latin, e.g., “through ”-per. Some very common words have trigraphic 
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equivalents beginning S and T to be found in a separate list of about 
175 such words; these are repeated in their alphabetical place in the 
main dictionary, e.g., “that”-SNA. There are also on this page 
“thou”-E, “thine”-HE, and “this”-HO; also “Thursday 5 1484, 
i.e., “number five day.” Plurals are designated by placing the letter 
S immediately after the digital code-group. There are also many other 
circumstances, particularly in the case of parts of verbs, which neces¬ 
sitate the prefixing of up to three letters before the digital code-group 
as the example taken from Cave Beck’s book shows ( Plate 23). As 
each word or concept treated as a word consists generally of a combina¬ 
tion of letters and digits, it is necessary to put a comma after it to 
delimit it. I have wondered whether G and 8G in the Voynich Manu¬ 
script might be accounted for as representing “comma” and “plural 
comma.” (I am aware that 8 occurs in other contexts in the manu¬ 
script, but so does S in Cave Beck’s system.) Curiously, in his example 
{Plate 23) his substitution for “days” is given as 14848, not 1484S, as 
one would have expected. 

Cave Beck starts his preface as follows: “This last century of years, 
much has been the discourse and expectation of learned men, con¬ 
cerning the finding out of an universal character.” If this is to 

be taken literally, it carries the idea of universal language back to 
about the middle of the 16th Century. I tried in 1957 to trace back 
the idea of universal character, but I had little time to devote to this 
research and the earliest evidence I was able to find is contained in the 
following extracts from two lives of Bishop Bedell, who died in 1642, 
one by Burnet, dated 1692, and the other by Clogy, dated 1682, re¬ 
ferring to a man named Johnston. 

“But the Bishop finding the man had a very mercurial wit, and a great 
capacity, he resolved to set him to work, that so he might not be wholely useless 
to the Church; and therefore he proposed to him the composing of an universal 
character, that might be equally well understood by all nations: and he showed 
him, that since there was already an universal mathematical character, received 
for Arithmetick, Geometry and Astronomy, the other was not impossible to be 
done. Johnston undertook it readily and the Bishop drew for him a scheme of 
the whole work, which be brought to such perfection that ... he put it under 
the Press, but the Rebellion prevented his finishing it.” 

“My Lord gave him a platform which he observed; all the difficulty was about 
the syncategoramata. He styled his books with spell. I heard that some part 
of it was printed; but the rebellion prevented finishing.” 

It would seem that these events cannot be dated later than 1641. 

It is, however, difficult to date the manuscript anywhere near as 
late as this. Charles Singer, in a letter to me, put the date at late 
16th Century. Professor Panoffsky and the keeper of the manuscripts 
at the Cambridge Library both independently insisted on a date within 
20 years of 1500 A.D., and the manuscript as we have it may be a 
copy of a much earlier document. 
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HERBALS 

At this point I propose to undertake a short digression on the sub¬ 
ject of manuscript and early printed herbals. This may be thought 
irrelevant and my treatment will certainly be very superficial, but, 
if the plain text of the Voynich manuscript belongs to the illustrations 
on the same pages, as we have a right to expect in the complete 
absence of evidence to the contrary, then much the greater part of that 
text is related to plants. However, I have to admit that to the best 
of my knowledge no one has been able to find any point of connection 
with any other mediaeval manuscript or early printed book. This is 
all the stranger because the range of writing and illustration on the 
subject of the plant world from the early middle ages right through 
into the 16th and even 17th centuries is very limited indeed. 

The first significant name is Krateuas, the Rhizotomist (rhizotomist 
meaning “root-digger”) who was body physician to Mithridates VI 
Eupator, the King of Pontus who was defeated by Pompey and took 
his own life in 63 B.C. Krateuas wrote a herbal in whjch plants were 
not described but were depicted in figures which were followed by brief 
discussions of the medical uses of plants. Some of this herbal has 
been preserved by Dioscorides who was a physician attached to the 
Roman army in Asia about the middle of the first century A.D., and 
who is much the most famous name as a herbal writer in classical times. 
He wrote in Greek the work usually known as De Materia Medica. 
One of the most beautiful and, I imagine, the most valuable manu¬ 
scripts in existence is the so-called Juliana Anicia codex at Vienna, 
written about 512 A.D. for presentation to Juliana Anicia, whose father 
was for a short time western Roman Emperor in 472 A.D. Two 
facsimiles of this manuscript have been published, the second in five 
parts, of which the first two are in the Garden Library at Dumbarton 
Oaks. Part of the manuscript is a text of Dioscorides, and the whole 
is magnificently illustrated in colour. I have reproduced here in Plates 
24 and 25 two pages of the manuscript. Charles Singer, in a most 
interesting article in the British Journal of Hellenic Studies for 1927, 
has restored about ten of the original drawings of Krateuas from it. 

Plate 26 shows Krateuas engaged in painting a Mandrake held by 
Epinoia, the genius of Intelligence, while Dioscorides beyond writes 
in a book. This representation of mandrake as a nude figure, male 
or female, with leaves sprouting from or replacing the head, persists 
in illustrated herbals into the era of early printed books and even into 
the 17th century, carrying with it a collection of superstitions, some of 
them to do with the precautions to be taken when digging it up. 

In 1957 I paid visits to a few specialists in early herbals in England. 
Among them I saw the late Dr. T. A. Sprague in Cheltenham and 
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showed him a few specimen photostats of herbal drawings from the 
Voynich manuscript, of which he had been previously unaware. As he 
looked at them he became more and more agitated and eventually 
said 

“Do you know what you are asking me to do? I have spent the last twenty 
years of my life trying to identify the plant drawings in the Juliana Anicia codex 
when the names of the plants are given in Greek, Latin and usually Arabic, and 
you are asking me to identify these awful pictures.” 

One of the earliest illustrated printed herbals was the Herbarium of 
Apuleius Platonicus (printed in Rome about 1481) with Latin text. 
(This is not the Apuleius who wrote The Golden Ass). Virtually 
nothing is known of him, but his work is believed to have been orig¬ 
inally written in Greek about 400 A.D. There are many extant manu¬ 
scripts of it, including a finely illustrated translation into Anglo-Saxon 
about 1050 A.D., now in the British Museum. 

But the three main herbal Incunabula, all originally printed in 
Mainz, are the Latin Herbarius —1484, the German Herbarius —1485, 
and Ortus Sanitatis —1491. This latter also appeared in translation 
as Jardin De Sante and Gart Der Gesundheit. There is a good facsimile 
of the German Herbarius. The woodcuts from all these are all rather 
stylized, obviously not drawn from nature. 

Plate 27 is a typical illustration from the Latin Herbarius. 

Plate 28. —I couldn’t resist introducing this as a lighter note. It 
shows the woodcut of Narcissus from Ortus Sanitatis beside Lear’s 
“Manypeeplia Upsidownia” from his Nonsense Botany. 

Plate 29 was prepared from a copy of Ortus Sanitatis at Dumbarton 
Oaks, showing the female Mandrake picture and the text that goes 
with it. 

A new era in the history of the herbal may be said to date from 1530 
when the first part of Herbarium Viuae Eicones was published in 
Strassburg. This is known as Brunfels’ herbal, and in it the plants 
are represented as they are. Plate 30 shows a woodcut from this book. 

Throughout the texts of all the early printed herbals (and presumably 
their forerunners in manuscript) runs the theme of the four elements 
and their natures—fire, hot and dry—air, hot and moist—earth, cold 
and dry—water, cold and moist. For instance, in the German 
Herbarius one finds on nearly every page a sentence such as “A finds 
this plant of the second degree of coldness and the third degree of 
dampness, but B considers it of the third degree of coldness.” 

In general, the illustrations in the early printed herbals are limited 
to two or three collections of stylized woodcuts copied over and over 
again in more and more degenerate form. The same superstitions 
persist. Probably the most fantastic is the story of the Barnacle Tree. 
According to one version, trees sprout from the wood of shipwrecks, 
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shells grow at the ends of the branches, and from the shells small 
geese gradually emerge and fly away. 

Plate 31 is taken from Gerard’s herbal—1597. Gerard claims to 
have seen this process in action with his own eyes on a small island in 
Lancashire. 

I make no apology for this short survey of herbals. I repeat that 
my purpose is to interest a wider circle, and the text and illustrations 
of herbals of the 13th, 14th, and 15th Centuries are a most interesting 
part of the background of this odd book. To the best of my knowledge 
no one has seen any book, certainly no illustrated book, of the period 
which covers the wide range suggested by the drawings in it. 

There are many aspects of the study of the manuscript which I have 
not covered in this paper. Little has been said about Roger Bacon, 
very little of John Dee, nothing at all of such figures as Ramon Lull. 
There is as yet no solid evidence that the manuscript is not by Roger 
Bacon or a copy of a work by him. 


BIBLIOGRAPHICAL NOTE 

[lj W. R. Newbold (edited by R. G. Kent), The Cipher of Roger Bacon , University of 
Pennsylvania, 1928. This book was published after Professor Newbold’s death 
from his notes and would probably not have been published at all had he lived 
longer since, after the first announcement of his decipherment of parts of the 
Voynich Manuscript, Newbold made virtually no progress at all. In spite of the 
fact that his solution is now quite unacceptable, the earlier part of this book 
should be read by anyone interested as it provides a more comprehensive back¬ 
ground than any other source. 

[2 J Agnes Arber, Herbals , Their Origin and Evolution, 1470-1670, second edition, 
Cambridge, 1953. This is the standard work on the history of herbals. 

13 J Robert T. Gunther, The Greek Herbal of Dioscorides, Hafner, 1959. The text of this 
book is an early English translation of De Materia Medica of Dioscorides, very 
fully illustrated with line drawings traced from the illustrations of the Juliana 
Anicia manuscript. 
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“THE MOST MYSTERIOUS MANUSCRIPT IN THE WORLD” 

THE ROGER BACON CIPHER MANUSCRIPT 

(BACON, ROGER ?.) Cipher manuscript on vellum. Text written in a secret script, 
apparently based on Roman minuscule characters, irregularly disposed on the pages. 
102 leaves (of 116; lacks 14 leaves), including 7 double-folio folding leaves; 3 triple 
folio folding leaves; and one quadruple folio folding leaf. With added signature marks 
(of the XVth or XVIth century), and foliation (of the XVTth or XVIIth century) 
Ml, 13-58, 65-73, 75-90, 93-96, 99-108, 111-116. With about 400 drawings of bo¬ 
tanical subjects, including many of full-page size; 33 drawings of astrological or 
astronomical subjects, plus about 350 single star-figures; and 42 (biological?) draw¬ 
ings, most of which include human figures. The drawings colored in several shades 
of green, brown, light yellow, blue, and dark red. Large 8vo (c. 230 X c. 160 mm.). Old 
limp vellum covers (now detached). From the libraries of John Dee (?), the Emperor 
Rudolph II (reigned 1576-1611); Jacobus Horcicky (Sinapius) de Tepenecz; Joannes 
Marcus Marci of Cronland (1666); Athanasius Kireher, S. J.; and Wilfrid M. Voynich. 
Accompanied by an Autograph Letter signed by Joannes Marcus, presenting the book 
to Athanasius Kireher. 

No place or date, (XVth century, or earlier?). 

An enigmatic mediaeval manuscript, which for over forty years has baffled the scholars and crypto¬ 
graphers who have attempted to wrest its secrets from it. It has been termed by Professor John 
M. Manly, who made a detailed study of it, “the most mysterious manuscript in the world.” 

Plate 4. 
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Plate 7. 
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Plate 8. 


61 













VOYNICH MANUSCRIPT 



Plate 9. 
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Plate 10. 
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Plate lib. 
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, Pa ( rt f ° f f ° U ° 68 recto ’ “ various enlargements: A is enlarged about two diameters and B 

thTLZ^Tth 618 '- 3 C l rtam am ° Unt ° f enlargement ' the characters lose definition, but 

the drawings of the microscopic characters by Newbold in B look probable. 

Courtesy of Wilfrid M. Voynich 
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THE UNIVERSAL | 

CHARACTER^ 

By which all the Nations in 
the World may underftand 
oneanothers Conceptions, t 

Reading out of one Common 
Writing the& own 
Mother Tongues. 

INVENTION 

OfGfiNERAL Uie, 

The Pradile whereof may be Attai¬ 
ned in two Hours fpacc, Obfcrving the 
Grammatical Directions. 


Which CHARACTER, is fo contri¬ 
ved y that it may be Spoken a$ well 
as Written. 


iy Cave ‘BecL M.A. 


T% ^ 

U U 4 


II 


r/itkjsf , and 

l6%7. 
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that confund fha 

to thatch 3757 
to than, no 1261 
a theater, tarn 480 
a theeFe p 1745 
then adverb of time fni 
tnm in companion fno 
thence * fnu 

Theology ti 6 o 6 
therefore fpa 

there " '—/’irfe 
a Thefis 3182 ® 

tbicke 0802“ —-* 13 * 
the thigh f ‘ 

a thiller ofa Cart r'3759 

a thimble ■ r'3766 

thinne q 312 
thine ■' he 

a thing raobp 
to thinK.e 1163 
to thirfc i 6/4' 
this • ho 

athiftlc r 3761 

thither ' •'* fpe 

a thong >r r 3762 

a thorne ■ ~r 3763 

a white thornc ' r *764 

& blacke thorne r ^763 
a thorne-backe fifh r 3766 
a thorpeor village r 3767 
through per 

thou ' c 

thraldornc x 626 
th-c'd r 3768 

threadbare or old 2cS 


to threaten “file ?A 1 ' 
to threfli* 1 • 
to thrive v : : 

thrift: or frugality t.2or6 
the throat rtif 4 ‘ 
a throne r^43b 
to throb 116“ ‘ 
to throng, v.CVi*il : 1399 
a chreftlebird' ! ! 3769 
to throw Or cafe * 927 
thrummes *' ^3770 

a thruih bird ' r 3771 
to,thhifc 1400 
a IBambe, +*,15)07 
to thumpe or knocke 670 
a thunderbolt, fare 624 
thunder r624 ' 

Thurfday +5,1484 
to/hwack, ant 508 
thyme herb r.3772 

to tickle ' > -’h 7 -' 3773 

the tide ‘ ^3774 

tidings *' r 3775 

to eye or binde 302 
tiffany f • r 377$ 

at-ger - • **r 3777 
a tick of a bed r 3778 

a tick worms 1 3779 

a tile r 378o 

a gutter tile or roofe tile 

r 2.197 

to till or allure 138 
to till the ground " 1422 
till or until! * fpa 

atm 
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An Example of writing and fpeaking 
the fifth Commandemmt . 


Honour thy Father and thy Mother 
tfrit^ leb 2314 p 2477 and pf 2477 

SftA ^ ieb coreonfo, pee tofofenfen, 5 c pif tofofenfea 

That thy days may be long 

fna, her 14848 mb 1716 
fna heronforafos, nub onfenonfic 

In the Land which the Lord ' 

in p 1699 vip 2529 

in poniicninnin viptohtonin 

pee byio 
pee behono 

’Note , for Euphonyfake or the better found, the.lqt* 
ter [Q in (V*3 may fometimes be left out'and 
[r 1 in 'at~\ may for the fame caufe be omitted or 
changed into theConfon 3 nt following as onforafo of 
onforaffo , wh ; ch I berty is ufually taken in all lan¬ 
guages as Commend for Conmend affemble for ai* 
iemble. 


L A U S DEO. 
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Fio. 3.—Tnacnto fbow the Juuaxa Akicia Codes or 512, Fo. 5 v. 

55'/*®** I*oW» a mandrake in her hand. Krateuas is painting it, while Dioskurides 
writes an account of it in a codex. In the original figure much of the paint has 
ecaled off and our restoration is conjectural in places. 


Plate 26. 
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BRIONIA 

Fig. 6. "Brionia” [Latin Herbarius (Amaldus de Villa Nova, 
Tractatus de virtutibns herbarum ), 14993 
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Fig. 22. Walwurtz mannlin", Symphytum officinale L., var. purpureum 
Pers., Comfrey QBrunfels, Herbarum vivae eicones , vol. i, 1530]] Reduced 


Plate 30. 
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Thcbrccdc of BamaUcs. 



Fig. 60. “The breede of Bamakles” QGerard, The Herball , 1597]] 

Plate 31. 
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Linguistic Canonical Forms: 
Tools for Analytic Progress 

BY CLIFTON H. BUTCHER 


THE LAY CONCEPT OF LANGUAGE 

The lay idea of language in a literate society is one of “words,” their 
meanings, and their spelling. The word is paramount—and publishing 
houses turn out paper back books for the popular market with titles 
such as Word Power Made Easy and 30 Days to a More Powerful 
Vocabulary . Success for a typist is partially predicated on typing cor¬ 
rectly, i.e., the correct spelling and hyphenation of words as well as 
their correct punctuation in sentences; therefore there are books such 
as Spelling for the Millions, Words Most Often Misspelled and Mispro¬ 
nounced, and 6 Minutes a Day to Perfect Spelling. And there are 
similar guides pertaining to style and word selection for writers and 
editors ( English Verbal Idioms, etc.). 

Word selection is the key to success; choose the exact word with the 
precise “meaning” to portray the “thought” you are trying to convey! 
And in the American society there must be an “authority” (an Emily 
Post as it were) for those meanings—and the authority is the dic¬ 
tionary. Any dictionary at hand is used to settle any argument. No 
attention is paid to the initial date of compilation, the method used 
for selectivity and ordering of meanings, or to whether the meanings 
represent actual current usage rather than egocentric or dialectocentric 
beliefs. The first meaning is taken as the prime meaning and the first 
listed pronunciation as “the” pronunciation! If a word has five 
meanings, one of them has to be entered first in a line of print—but 
what were the criteria for selection for the initial position? Was it 
first historical written attestation; meanings in alphabetical order; 
frequency attestation in literature; meanings arranged in an order from 
formal and learned “down” to “colloquial,” “vulgar,” and “slang”; 
meanings arranged in regional order with New England usages coming 
first; or random order? There is a dictionary for every price range and 
every taste—and the same holds true in selecting a wardrobe, a car or 
a house. 

For most “amateurs of language,” i.e., “lovers of language,” who often 
get degrees in language and literature and then, to earn a livelihood, 
become teachers of said subject matter, or turn to the business world 
or government and become travel agents, translators, researchers, 
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foreign language copy writers, interpreters, or possibly “analysts” 
using language as a tool, the above lay concepts are still paramount. 
They see language as being “thoughts” expressed in “complete sen¬ 
tences” which are spelled and punctuated “clearly”—with the key to 
correct expression being word selection according to “basic” meanings 
of words (with basic having something to do with etymology or formal 
writing and nothing to do with frequency of usage by the speakers of 
the language). 

With this underlying concept of language it is easy to see why the 

analyst believes refinement of his language tool consists in the 
preparation of more and better specialized glossaries of words and 
their meanings, and in frequency counts of words, letters, digraphs, 
and trigraphs. Such counts are made by “dragging” one or more 
dictionaries or are based on running text (literature, newspapers, 
scientific periodicals, textbooks, inter alia) All such counts vary 
appreciably. Counts based on dictionaries are the worst since they 
reflect each and every lexical entry (in non-inflected form) whether 
obsolete or current and bear no statistical relationship to content or 
style of any given text. Counts based on running text are biased with 
the degree of bias reflecting the topics and styles of the selected texts; 
such counts are invalid to varying degrees when applied to other types 
of traffic or subject matter. 

THE SCIENTIFIC CONCEPT OF LANGUAGE 

Language, according to most schools of linguistic science, is a 
structured interplay of a number of linguistic levels—with each level 
having its own basic linguistic building blocks which combine in var¬ 
ious sets of relationships. A building block is not necessarily restricted 
to one and only one level. The most basic building blocks are the 
PHONEME, the SYLLABLE, and the MORPHEME. Phonemes 
are acoustic bundles perceived through or by a structural mental 
matrix as syntactically arranged in meaningful units called morphemes. 
Syllables are the carrier wave of human speech; phonemic distinctions 
associated with the “meaningful” elements of speech are mapped on to 
the carrier wave units by articulatory modification of the vocal tract. 
Morphemes are phoneme sequences which bear a “meaning” (morpho¬ 
logical, lexical, syntactic, psychological, social, and/or “semantic”) 
pertinent to the linguistic conventions of some given language and 
society, these morphemes enter into syntactic arrangements known as 
roots, stems, affixes, “words,” idioms, phrases, sentences, paragraphs, 
topics, discourses, and even works of art (literature) and commerce 
(newspaper, books). 

Some languages have “words” in that discrete phonological or syn¬ 
tactic markers are habitually associated with a certain low level of 
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legality. English is one of the many languages in which “words” are 
not a structural unit—our so-called words are psychological correlates 
to several types of linguistic structural units; as psychological units 
these words are the legitimate field of study for a number of disciplines 
that are concerned with various aspects of semiotics (anthropology, 
sociology, psychology, psycholinguistics, sociolinguistics, philosophy, 
religion, politics, advertising, etc.). Investigators in some current 
theories of linguistics are investigating formally, rigidly and scien¬ 
tifically certain aspects, properties and fields of semantics, establishing 
testable hypotheses to account for data and subsets of data which bear 
on the interrelationships between the sound units of physics, the 
patterned activity of speech habits, and the extremely complex multi¬ 
dimensional array of “meanings.” 

Put in another way, language, like practically all human behavior, 
can be described as particle, wave and field in a complex hierarchical 
ordering of levels. Acoustic elements of speech sounds are arranged 
into particles called phonemes—and these particles in linear sequence 
in time are sound waves (syllables and sequences of syllables)—and 
certain lengths of these waves are accompanied in a given language by 
conventional meaning correlates which can be analyzed as distributed 
in various sets and fields. 


THE SYLLABLE 

Put in still another way, for a given language a finite number of 
phonemes occur in a quite restricted number of phoneme sets each 
usually characterized by a vocalic nucleus, and these latter sets are 
known as syllables. A syllable is a time-linear sequence of one or more 
phonemes, and there are restrictions as to the total length of such a 
sequence, as to which phonemes can occur in which specific locational 
slots and as to the possibilities of adjacent co-occurrence of some or 
all of these phonemes. Syllables are the pulse-trains on which are 
superimposed structural and psychological elements such as affixes, 
roots, stems, “words,” idioms and strings of same. 

In writing, a secondary derivative of speech, the above structural 
and “meaningful” units are represented by sequences of letters (or other 
marks on paper or stone such as ideographs and syllable-graphs) which 
reflect on an imperfect non-one-to-one basis some, but not all, of the 
segmental and suprasegmental phonemes of a language. In ordinary 
lay terminology letters are divided into two classes, consonants and 
vowels. These letters are the minimal units of a partially rational 
stylized spelling system for the artifacts of culture called words. 
Letters are to linguistic science as the molecules are to physics in this 
age of atomic and sub-atomic physics: a convenience. 
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In linguistic terminology those phonemes and phoneme sequences 
which occur as syllabic nuclei are called vocalic or vowel phonemes, 
and those which occur at the onset and offglide margins of the syllable 
are consonant phonemes. For the linguist the minimal carrier wave 
communication unit is the syllable: a vocalic nucleus is obligatory and 
there are optional consonant phonemes or sequences at the linear 
margins of the syllable. In reality the phonemes are internalized in 
a mental matrix and are manifested in the world of physics by acoustic 
correlates actuated by articulatory movements modifying the shape of 
the vocal tract as various sound emitters or resonance chambers are 
switched on or off. 


CANONICAL FORMS 


Let V stand for any vocalic nucleus and C stand for any consonant 
phoneme or phoneme sequence. The generalized so-called canonical 
form of the syllable can now be symbolized as CVC; or, more truly, as 
(C)V(C) with the parenthesis setting off optional elements. Any 
structural unit can be represented by a symbolic canonical form and 
the linguists deal with a variety of such forms (as do mathematicians 
and logicians) at the higher abstract levels of linguistic theory as 
hypotheses are posited for testing and evaluation. 

The symbol C could theoretically be expanded to an infinite number 
of C s, but the maximal consonant phoneme clustering at the onset and 
offglide margins of the syllable is sharply restricted in all human 
languages. The canonical form of the syllable in English is CVC* the 
expanded form is (C)(C)(C)V(C)(C)(C)(C). This expanded form 
accounts for 20 patterns: 


V 

CV 

CCV 

CCCV 

VC 

VCC 

VCCC 

vcccc 


CVC 

ccvc 

cccvc 

cvcc 

cvccc 

cvcccc 


ccvcc 

ccvccc 

ccvcccc 

cccvcc 

cccvccc 

cccvcccc 


Some languages permit even longer consonant clusters. Languages 
vary as to their finite inventory of segmental phonemes, as to the 
restrictions on specific phonemes occurring in specific positional slots 
in the canonical forms, and, even more important, as to the restrictions 
on co-occurrence in adjacent slots. There are over 2,500 languages in 
the world; each has from 15 to about 70 segmental phonemes; each 
favors certain syllable patterns; probably no two languages have the 


90 


C. H. BUTCHER 


same set of phoneme co-occurrence restrictions: This is the linguistic 
reality underlying the intuitive and pragmatic practices of counting 
letters and letter sequences in languages used by people who have by 
cultural accident adopted an alphabet and become literate. 

“Words” are composed of one or more CVC sequences. But lan¬ 
guage is never simple. Syllables are “open” (no consonant phoneme 
at offglide position) or “closed” (with a consonant phoneme at offglide 
position). Then, too, there are the non-segmental or so-called 
suprasegmental phonemes (stress, pitch, tone, length, etc.) which have 
to be considered in the syntactic arrangements commonly called 
“words,” idioms, or possibly phrasal nouns (the mayor of New York’s 
hat), and utterances. E.g., the canonical form of the English mono¬ 
syllabic word uttered as an exclamatory “sentence” can be represented 
as CVC 41 (Fire! Help! Ouch! Ugh! Scram!). 

The composition of words of the various languages varies as to 
number of syllables and as to the permitted suprasegmental patterns. 
Analysis shows that although words vary from mainly monosyllabic or 
disyllabic (as in Chinese) to mainly polysyllabic (7 or more syllables 
as in Hungarian or Eskimo) for a given language, still the syllable 
patterns fall into three major classes: utterance or word initial, medial, 
and final. Each syllable class in a given language is subject to specific 
restrictions. A five-syllable word could theoretically have the canoni¬ 
cal form CVC-CVC-CVC-CVC-CVC: with maximal consonant 
phoneme clustering of 3 elements the expanded canonical form of the 
word would be (minus parentheses for options): 

CCCVCCC-CCCVCCC-CCCVCCC-CCCVCCC-CCCVCCC. 

Now let’s take some language X which permits maximal consonant 
clustering only at word initial and word final positions and no clustering 
in the margins of medial syllables; the shape of the word would now be: 
(C)(C)(C)V(C)-(C)V(C)-(C)V(C)-(C)V(C)-(C)V(C)(C)(C). Now let 
us assume each word bears one and only one lexical stress. The 
canonical forms of the 5 syllable word in language X are: CVC-CVC- 
CVC-CVC-CVC, CVC-CVC-CVC-CVC-CVC, CVC-CVC-CVC-CVC- 
CVC, CVC-CVC-CVC-CVC-CVC, and CVC-CVC-CVC-CVC-CVC. 
And each of these can be expanded to the option formulations and 
these can then in turn be “exploded” vertically to become formulas 
covering the phoneme selection restrictions for each position and co¬ 
occurrence possibility. 

Looking again at the 5 different patterns for the 5-syllable canonical 
word it can be seen that there are 6 syllable types: initial stressed 
(I), initial unstressed (I), medial stressed (M), medial unstressed (M), 
final stressed (F), and final unstressed (F). Let x equal any syllable 
type occurring more than one time in a word. An alternate canonical 
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symbolization would then be: I-M-x-x-F, I-M-M-x-F, I-M-M-x-F, 
I-M-x-M-F, I-M-x-x-F. 

The linguist investigates all the syllable types; the canonical forms 
for syllables, morphemes, and words; sentence types; and the prefer¬ 
ences for certain patterns, as well as making finite inventories of the 
phonemes, affixes, and function words along with most frequent roots 
and the beginnings of a lexicon. Much of this information is com¬ 
pressed in formulaic expressions and presented as a linguistic sketch 
or linguistic description of a language. The basic scientific sketch and 
a selection of items from the lexicon can be used for a variety of 
applications: constructing writing systems, writing reference grammars 
or courses in the language, translating religious material, analysis of 
style, social judgments about language usage, investigating mental 
disorders, machine translation, cryptology, etc. 

This discussion so far has merely made explicit some of the principles 
that underlie what was commonly known as philology in the XIXth 
Century and is known today as historical reconstruction. Modern day 
philologists (like Dell Hymes at Yale) use electronic hardware to sort 
the linguistic data and set up most of the comparisons. Given the 
hindsight we now have, practically all of the comparisons made by 
linguists in establishing the Indo-European language family and the 
relationships between the languages of the family in the past 130 years 
could be done (after key punching) by modem equipment in a few 
minutes (and printed in a few hours). 

The problem is, of course, far more complicated as we come to terms 
with speech (as contrasted to derivatives of speech such as writing) 
and try to devise methods whereby machines can be programmed to 
“interpret” speech or perform sorting operations on running speech. 
The machine will have to handle (in addition to segmental phonemes, 
canonical forms, restriction rules, preference selections, etc.) the com¬ 
plexities of the suprasegmentals (stress, pitch, intensity, tone, etc., as 
well as psycholinguistic correlates such as “tone of voice,” voice¬ 
printing, and the like). This is all extremely complicated—but recall 
that any and all of the patterned habits of a given language are 
thoroughly internalized by “any 6 year old native speaker” of a given 
language. 

SOME CANONICAL FORMS IN SPECIFIC LANGUAGES 

Most of the discussion so far has been very general and rather 
ivory-towerish” (“theoretical” in the lay sense of the word), but can 
be clarified in some respects by examples from “real” languages. 

Fijian 1 is an example of a language with a very simple set of 
canonical forms for the word, namely: # Ci Vi C 2 V 2 (C 3 ); #CiVi 

l [l],p. 236. 
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C 2 V 2 c 3 v 3 (C 4 ); V; V x CV 2 ; # CV #; and # C, V x C 2 V 2 # [where 
# is word juncture, the C’s indicate any consonant (or none), the V’s 
. an y vowel, and parenthesized C is lacking when word juncture or 
initial consonant of a following word follows immediately]. 

An Indonesian language 2 is an example of a language with a fairly 
large number of morpheme canonical forms (90) and a fairly average 
number of segmental phonemes (6 V and 21 C phonemes). The distri¬ 
bution of the canonical forms is as follows: 

6 canonical forms for morphemes with 1 V 
24 canonical forms for morphemes with 2 V’s 
60 canonical forms for morphemes with 3 V’s 

If all C and V exponents could occur in every position there are mil¬ 
lions of theoretical possibilities for the construction of morphemes. 
A structural analysis shows there are negative rules excluding certain 
sequence combinations and positive rules favoring certain other 
combinations. Inspection of a comprehensive dictionary reveals that 
this language actually has about 20,000 morphemes (not millions). 
A statistical survey of the morpheme occurrences matched against 
the existing canonical forms shows that 3 canonical forms (CVCVC: 
6354 occurrences, CVCCVC: 3453, and CCVCVC: 1789) account for 
about 60% of the morphemes, and that the inclusion of 7 additional 
canonical forms (CVCV: 1307, VCVC: 1240, VCCVC: 819, CVCCV: 733, 
CCVCCVC: 713, CVC: 636, CVCCCVC: 479) raises the count to about 
90%. 

Any one canonical form has a theoretical maximum number of 
morphemes it can generate; this number is sharply reduced after ap¬ 
plication of the rules of exclusion and of frequency preference; and 
only a fairly limited number of the remaining possibilities are ac¬ 
tually “occupied” by morphemes. In this language only two of the 
canonical forms have high realization percentages (CVC: 76% and 
CV: 48%), but note that both patterns have rather limited maximum 
theoretical possibilities and that both exemplify “laws” of least ana¬ 
tomical effort. If there were no restrictions, the CVC pattern would 
generate 2,646 morphemes; with restrictions it could generate about 
650; but only 636 actually occur. In percentages, about 67 % of the 
possibilities are eliminated by rule application and only 76% of the 
remaining possibles are realized; i.e., about 24% of the theoretical 
possibilities. 

The most favored canonical form in this language is the CVCVC; 
morphemes of that pattern account for almost one third of all the 
morphemes. And yet, this pattern is exploited to a very low degree. 
The pattern could generate over 333,000 morphemes; some 107,000 


2 [2], p. 248. 
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after rule application. Thus the favored CVCVC pattern realizes only 
6 % (6,354 items) of its possible morphemes. 

The enormous number of non-utilized morphemes in any language 
shows that all languages are capable of enormous lexical expansion as 
the people using these languages adopt literacy and develop into com¬ 
plex industrialized societies. It also indicates why lexical items bor¬ 
rowed from other languages are re-shaped to fit into gaps of the exist¬ 
ing patterns rather than borrowed in toto with alien patterns and alien 
phonemes. This is the linguistic basis underlying the recognition of 
various types of “foreign accents”; given all of the linguistic data for 
all languages of the world the linguist (or a software program) could 
predict the pronounciation of “sputnik” or “baseball” in all languages 
as such terms are borrowed. 

To illustrate the restrictions on the selections of consonant pho¬ 
nemes which cluster at syllable margins let us examine the identical 
canonical form [CVC —> (C) (C) (C) V (C) (C) (C)J used for non-in- 
flected monosyllabic words in two related Germanic languages, Eng¬ 
lish and a cognate Scandinavian language. 3 There are numerous wavs 
of analyzing the segmental consonant phonemes of a language. The 
analysis shown in Fig. 1 is convenient for the purposes of this paper. 



PHONEME CLASSES 

Resonants 

Fricatives 

Stops and Affricates 

Cognate 

r lmni) 

fssh 

j v 

p t k c 

b d g 

English 

r 1 m n i) w j 

« fssh 

9 v z l 

p t k c 

b d g j 


Fig. 1.—Segmental Consonant Phonemes. 


Members of the resonant, fricative, and stop classes of phonemes 
combine in two-and-three-unit pre-vocalic and post-vocalic clusters. 
A mapping of the class combinations on the canonical form yields the 
mirror image box diagrams of Fig. 2. 

Now let us look at some of the general restrictions on the consonant 
phonemes. In both English and the Cognate language /h/ is always 
prevocalic and /x\! postvocalic; the Cognate’s /c/ is always prevocalic 
and the English /z/ was always postvocalic until quite recently (the 
name of Zsa Zsa Gabor has changed the phonemic distribution). The 

3 [3], Fig. 64, p. 132. 
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general restrictions on clusters are as follows. In the cognate language 
prevocalic /s, h, c/ never occur as members of clusters. In English pre¬ 
vocalic /d, v, z, z, c, j/ never occur as members of clusters. The phoneme 
positional distribution is shown in Fig. 3 (positionally unique pho¬ 
nemes are underlined). Note that the restrictions on clustering are all 
prevocalic. 



PRE VOCALIC 

POSTVOCALIC 


Clustering 

Non-Clustering 

Clustering 

Cognate 

r 1 m n 


r 1 m n i) 

f s 

j V 

s h 

f s § 

j V 

p t k 

b d g 

c 

p t k 

b d g 

English 

r 1 m n 


r 1 m i) 

6 f s s h 

d V z z 

0 f s s 

d v z z 

p t k 

b d g 

C J 

ptkbdg £ ] 


Fig. 3.—Phoneme Position Distribution. 
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The picture becomes far more complicated as we analyze the co¬ 
occurrence restrictions on the individual phonemes. The postvocalic 
co-occurrence restrictions are quite complicated in both languages; 
only the simpler prevocalic restrictions will be illustrated in this 
paper. Figure 4 shows the consonant phonemes which can precede 
/ j / in the cognate language (recall that / j / and / w / in English are 
analyzed as part of the vocalic cluster). 



Resonant 


Fricative 



--'s 





/' m \ 


/ ^ \ 



* ' 
i 


i . \ 

J j 



\ n ; 

/ 

■ ' • \ ! 

; / 



Fricative 


Stop 

/ / 

Resonant 


Cognate 



'n 

* P 

; j 

/ 


Vowel 




\ b / 

/ 






\ / 

/ 




Fricative / 


Stop 








• f ! 





\ 





'v_^ 





Fig. 4. 


Figure 5 shows the consonant phonemes that can precede the nasal 
and lateral resonants (/ n, m/ and /!/). 
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Resonant 


Fricative 





Stop 


Resonant 







■^A 1 ) 




Cognate 

( ,C\f 

V g V' 




V 



Fricative 

.f 

_ _ m j 





Xj/ 








Stop 











Resonant 


Fricative 





Stop 


Resonant 





/P \ 

1 b\ 


J, 1 ) 




English 



/"'A 

n ; 


V 



Fricative 

/ 






u y 
V V 


.* m ; 






Stop 








Fig. 5. 
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Figure 6 shows the consonant phonemes that can precede the re¬ 
sonant / r /, the voiceless stops / p,t,k /, and the labiodental fricatives 
(cognate’s / v /, English / f /). 
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A consolidation of Figs. 4, 5 and 6 would show all of the permitted 
prevocalic clusters (with the single exception that the cluster / ski-/ 
does not actually occur in English words) but would be very difficult 
to “read” in this format. Figures 7 and 8 illustrate the selection choices 
but not the co-occurrence restrictions for the non-inflected mono¬ 
syllabic word in the two languages; the phonemes in the small boxes 
are those that do not occur in clusters and/or those which preclude 
further clustering by occurring in that particular position. Recall that 
the choice at any position also includes the selection of no consonant 
phoneme and excludes the possibility of doubling a given phoneme. 



Fig. 7.—Cognate’s Non-inflected Monosyllabic Word. 
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Fig. 8.—English Non-inflected Monosyllabic Word. 

When the consonant phoneme selections are considered as numeri¬ 
cal possibilities, the information about the canonical forms can be 
posited as formulas or graphs. The formulas and graphs for clustering 
in non-inflected monosyllables in the two languages are shown in 
Fig. 9. 



Fig. 9.—Phoneme Possibilities in Clusters Within Monosyllabic Canonical Forms. 
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Note the typological similarity in prevocalic position; a comparison 
with the other Germanic languages may reveal that this patterning is 
a family resemblance. The differentiation in postvocalic position may 
be the specific distinguishing characteristic. 

When the choices are widened to include single consonants or no 
consonant the formulas and graphs become those of Fig. 10. 



Fig. 10.—Phoneme Possibilities Within Monosyllabic Canonical Forms. 

USES OF CANONICAL FORMS 

How can such arcane and esoteric linguistic studies be used? Of 
what, if any, practical value are they or could they be? Various 
intermediate steps are needed before any concept or generalization of 
pure or theoretical science can be utilized as a technique in applied 
science or technology, and then, of course, still more steps before 
application by wider groups of specialists, technicians and analysts. 
To span the gap the linguist is obliged to map the written letters of 
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a given alphabet against the phonemes and syllables of a given lan¬ 
guage, and to list all the relationships of fit between the components 
of subsets of linguistic theory and a conventional writing system with 
all of its inadequacies and ideosyncracies garnered through time. 

Remember, too, that a phoneme can be presented or represented 
in many ways (graph phoneme letters; oscillograms; mathematical 
notation as formants, formulas, numbers, or intersections; two-di¬ 
mensional chartings as acoustic “distinctive features”; three-dimen¬ 
sional graphics; kinesic symbols for articulatory movements; pala¬ 
tographs; x-ray motion pictures of articulatory movements; numbers 
to a base of 6 or some other base for key punching and data pro¬ 
cessing; and even brainwave patterns—and this is just for acoustic 
and articulatory phonology inasmuch as practically nothing is known 
to date about perceptual phonology). 

Given this “mapping” by the linguist and a set of instructions, the 
“words” of a dictionary or a corpus of traffic can be coded, punched 
and programmed for manipulation by computers. The machine can 
do all of the laborious (for humans) sorting tasks and provide the lexi- 
costatistical data based on the syllable and lexeme patterns which is 
then used by language and communications technicians in their tasks 
of information processing. When the statistics are added to the can¬ 
onical patterns, the material is ready for exploitation by Markov 
chaining and stochastic processes. The canonical forms of the sylla¬ 
bles and of words and other lexical units can be established for any 
language that is of research interest and all of the restrictions precisely 
stated. 

The techniques for the exploitation of lexicostatistical and pat¬ 
terned data will not be covered in this article. Linguistic analysis can 
have many applications; the remainder of this paper will cover tech¬ 
niques of linguistic analysis and proposals for application of same 
that will aid those language technicians who process what is known as 
Voice work’. 
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THE NEEDS OF THE “VOICE LINGUIST” 

A linguistic analysis of a language is extremely important to the 
“voice linguists” working as transcribers, gisters, and transcriber 
supervisors. What the experienced voice man believes to be his “ear” 
is actually many things, including aural acuity, ability to filter out 
noise and to disentangle two or more speech signals which at times 
override each other, an excellent command of the foreign intonational 
patterns and audible paralinguistic features of the target language, an 
extensive vocabulary and/or thorough familiarity with a more stereo¬ 
typed smaller inventory of words and expressions, work experience, and 
his “feel” or “intuition” as to what he hears when the tape quality is 
poor and the “words” include items that are not in his active or 
passive vocabulary. This “feel” or “intuition” which helps him recon¬ 
struct sounds and then guess at spelling so that he can then use a 
dictionary or specialized glossary reflects the degree to which he has 
internalized the systemic relationships of the phonemes, canonical 
forms of syllables and morphemes and words, intonational patterns, 
and the statistical probabilities of same. A countering influence or 
cause for error is based, among other things, on the dominant inter¬ 
nalized systemic patterning of his native speech normalized in the 
perceptual adjustments he is accustomed to making as he listens to 
speakers of other regional or social dialects (plus competing patterns 
from the X other languages in which he has acquired a speaking or 
comprehension skill). The techniques of applied contrastive linguistics 
can be used to isolate many of these variables and focus on the causes 
for indeterminacies. 

But for the phoneme and syllable analysis to be of value to the 
“voice linguist” still another analysis must be done by the scientific 
linguist and presented as a chart or some other type of linguistic aid. 
The phonemes are the bundles of acoustic signals which mean nothing 
but which serve to differentiate meaning—and an alphabet is an 
imprecise recognition of some of these differential clues. The voice 
man at the skill or non-cognitive level is not dealing with letters (or 
words or meanings) but with the selected features, used within the 
linguistic system of a given language, of pulses of sound waves. Now 
we’ll have to introduce one additional term of linguistic science jargon, 
that of the “allophone.” 

The phoneme is an early invention of mankind (indeed, for some 
theorists the defining characteristics of man are his oral “manipula¬ 
tions” of the phoneme and the applied uses of the opposable thumb)— 
although the term itself and the degrees of precision as to its usage in 
linguistic theory only goes back a few decades. But, the phoneme is 
an abstraction—a basic building block of language and speech, but 
still and all an abstraction used only by human beings with human 
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brains. Note how limited are the other so-called languages that are 
based not on an abstraction but on recognizable physical features (or 
derivatives of some grammatical subset of language such as numbers), 
e .g., the “language of flowers,” the “language of gems,” candy marking 
codes, labels for wedding anniversaries, beacons, flag systems, salutes, 
and even such things as apparel to mark the “in” group, or disfigura¬ 
tion (tattooing, scarification) or application of pigments (cosmetics) 
to the skin. Higher systems of communication are secondary deriva¬ 
tives of speech (talking drums, Canary Island whistle speech, writing 
systems) or tertiary derivatives (Morse alphabet, semaphore alphabet, 
sign language of the deaf, Braille, etc.). 

The manifestation in the real world of physics of the abstraction 
called the phoneme is the allophone. Human sounds are just sounds; 
human speech sounds are allophones representing the abstract 
phonemes of some specific human language. A phoneme can be con¬ 
sidered as a mental target which is aimed at by the neuromuscular 
speech mechanism but whose realizations are never precisely on target 
and never precisely the same for any repeated attempt. A convenient 
metaphor is that of a burst of buckshot on a paper target (Fig. 11). 

Let the entire buckshot burst represent the phoneme /p/ with the 
target center being the mental construct and the individual pellets 
being the actualized [p] allophones by one individual in various /p/ 
attempts or the various results of many individuals in similar attempts 
(note that slant lines are used to mark phonemes and square brackets 
to mark allophones in phonemic notation). The voiced counterpart of 
/p/ is /b/, and the nasal voiced counterpart is /m/; all three phonemes 
share a number of physical correlates and each has some distinctive 
characteristics which set it off from the other two or at least one of 
the set. Now let us fire the gun three times at the target, one time 
for each of the three phonemes, and observe the shot displacement 
(Fig. 12). 

See how trouble is arising. Certain acoustic phenomena can be inter¬ 
preted according to patterning, “meaning” and noise interference as 
an allophone of one, two, three (or possibly even more) phonemes. 
What is he selling? A “bear” or a “pear,” or is it a “mare”? Acous¬ 
tically there is very little difference in the sound of the stop consonants 
(actually none since stop phonemes are a lack of sound with the 
signals in reality being carried by part of the marginal components of 
adjacent vocalic sounds), so maybe he said “dare,” “tare” or “nair,” 
or was it “care” or maybe “hair” (or “tear or “hare,” or even “there” 
as pronounced in some dialects). The brain deals and traffics at the 
cognitive level in “meanings” and tries to make a perceptual “Gestalt” 
with a meaning for any unknown series of human speech sounds. To 
prove this, listen to any unknown language and you will “hear” an 
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Fig. 11.—Manifestations of a Phoneme. 


occasional real English word in this stream of syllabic nonsense. The 
mind will try its best to pattern these strange sounds into something 
understandable; like the many Frenchmen in World War I who were 
subjected to the American doughboy joke of being greeted in fluent 
Americanese with a “Barndoor manure!” and who wondered why those 
Americans laughed so hard as he politely replied with a “Bon jour, 
Monsieur.” 

To add to the complication of the phoneme concept, the sound 
features and buckshot pattern of a given phoneme vary according to 
position within a linguistic unit and according to the adjacent and 
even nearby sounds; e.g., the “p” in “pit” and “spit” are appreciably 
different allophones of the phoneme /p/ in English (but we native 
speakers never notice this difference until it is called to our attention). 
The Hindi speakers hear these two allophones of /p/ as allophones of 
two separate Hindi phonemes; when the Hindi speaker speaks English 
he will vary in his selection of phonemes to cover English words and 
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Fig. 12.—Manifestation of Three Similar Phonemes. 

his attempts at “pit” and “spit” will sound to English native speakers 
to be “bit’ and “sphit” (with variations too for the /s/, /i/ and /t/). 
Therefore one linguistic aid for the voice linguist would be letter 
charts” representing the phonemes and distributional allophones of the 
target language; a variation of this aid or a separate one would be a 
phoneme by phoneme comparison of the sounds of English and the 
target language. Still other aids would be tables of statistical prob¬ 
abilities of occurrence of certain sounds and sequences of sounds. 
Other charts could cover the most frequent misinterpretations for 
certain sounds (“f ’ and “s” give difficulty in practically all languages). 

The tools of the voice linguist to date are his dictionaries, glossaries, 
reverse sort dictionaries, rhyming dictionaries, card files, and electronic 
retrieval systems (information or lexical storage). In addition to the 
type of aids outlined in the preceding paragraph there is a whole 
battery of new aids and tools which linguists and programmers acting 
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in concert can provide for the “voice” specialists (and for translators, 
supervisors, and cryptologists). Once the phonemes, syllables and 
canonical forms are matched against a given dictionary or corpus of 
material in a machine program, any kind of a sort can be made. 

Dictionaries are extremely inefficient tools for voice work (in terms 
of man-hours or man-years when all dictionary thumbers are included); 
if the “word” is not recognized, the voice man requires real assistance. 
So why not provide a dictionary (using conventional spelling) in a more 
usable form. The linguist sees the value of word sorts according to 
canonical shapes and lexeme patterns rather than alphabetical listings, 
i.e., listings of all monosyllabic words, disyllabic words with initial 
stress, disyllabic words with final stress, etc., (no meanings are required, 
that is the function of dictionaries and glossaries). Each lexeme pat¬ 
tern can have several listings: an alphabetic one, vowel rhyme, con¬ 
sonant class criteria (stops, fricatives, nasals; voiced, voiceless; etc.) 
and others, thus putting in close proximity all words which rhyme with 
a linguistic ordering based on probabilities of mis-hearing. Other lists 
would be by syntactic classes (similar to the so-called “parts of 
speech”) or by words which participate in more than one such class; 
some words function as both noun and verb, others as noun and 
adjective, others as adjective and adverb, others as verb and adjective, 
still others as noun, verb and adjective, etc. Another aid could be a 
list of the “function” words of a language, the meanings of these 
funtions in various sentence types, and suggested translations of these 
function words in those particular sentence types. Still another aid 
would be the roots and stems of a given language with a special 
section or another aid for the affixes and their permutations on the 
stems. An aid combining syntactic and phonological criteria for words 
could be devised (Fig. 13). Such aids or lists could be revamped to 
reflect usage statistics for various types of content or traffic. 


SPEED PHONEMICS 

The discussion to this point has been concerned with linguistic units 
as static entities rather than as active malleable units in the stream of 
speech. What changes do the allophonic representations of the 
phonemes undergo as speech speed is increased? Do some phonemes 
just disappear? And if so, which ones and under what circumstances? 
A field which has barely been scratched is the cross-disciplinary field 
of speed phonemics (which requires work by linguists, acoustic 
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Fig. 13.—Word List by Function and Rhyme Scheme. 

phoneticians, perceptual psycholinguist researchers, physicists, mathe¬ 
maticians, and hardware technicians); and little will be done until 
funds are made available and the need for such research realized. 

Phonemes are isolated early in a linguistic investigation; the linguist 
then devotes his attention to the morphology, syntax and lexicon of a 
given language. Phonemes are elicited by comparing (minimal) pairs 
of lexical items or very short phrases spoken distinctly in isolation; 
these phonemes are normally presented as graphic symbols on paper 
and little further investigation is carried out unless the language con¬ 
cerned is characterized by “interesting” or complicated morpho- 
phonemic patterns. An example in English of such interesting patterns 
is shown in (the pronunciation and spelling of) the following word 
pairs: 

knife: knives child: children ride: rode 

foot: feet wide: width sheep: sheep 

Speech and language permit communication despite considerable 
suppression of acoustic signaling data through the devices of syntactic, 


Noun 

Verb 

Adjective 


fair (1) 


Noun 

Verb 


pair 
bear (1) 
bear (2) 
tear 
tare (3) 
dare 
care 

chair (1) 
chare 


hair 


Verb 

Adjective 


bare 


Noun 


pear 


tare (1) 
tare (2) 


gair (1) 
chair (2) 


fair (2) 
hare (1) 


Verb 


pare 


hare (2) 
hare (3) 


share 

[etc. through all initial single phonemes] 


spare (1) 


spare (2) 
stare 


stair 


[etc. through all initial consonant clusters] 
[Then move on to next vowel nucleus] 


peer (1) 


beer (1) 
beer (2) 
bier 


peer (2) 


Adjective 


gair (2) 
there (1) 


Adverb 


there (2) 
fair (3) 


Pronoun 


there (3) 
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semantic and phonetic redundancy; thus we can understand young 
children, toothless oldsters, gum chewers, pipe biters, and even shouters 
in a boiler factory. This is true for all languages. In literate societies, 
suppression of phonetic data by the speaker (as contrasted to environ¬ 
mental noise) is judged to be “sloppy” speech; the prescriptive 
standard or imagined “norm” is the written form of the words and 
words are supposed to be enunciated as they are written (whatever 
that means). The normative prescriptive teacher of English tries to 
make the students make a differentiation in their pronunciation of 
“sentences” like the following (“phonemic” notation that of a diction¬ 
ary and not truly phonemic): 

Picture right here! ’pikcha(r) 

Pitcher right here! ’picha (r) 

Pitch her right here! pich (h)er 

Most people can make a differentiation in slow studied speech; but in 
regular or rapid speech there is no difference in the above three ex¬ 
amples [picha (r)]. The semantic differentiation is supplied by the 
situational context, not by the phonemes and not by the syntax; the 
“words” are an after-the-fact analysis supplied by the hearer. 

The common languages taught in America have been French and 
Spanish; and Spanish is “judged” to be easier to learn. But is it? 
The judgement is really concerned with phonology and the spelling 
conventions. “Spanish is phonetic” (or “Russian is phonetic”); it 
sounds the way it is spelled! In reality, Spanish is a syllable-timed 
language and such languages are characterized by far less phonetic data 
suppression than are stress-timed languages such as English; thus 
Spanish is “easy” for English speakers. Maybe the Martians “speak” 
with their eyes; thus their language is not phonetic; but we gaeans (or 
terrans), unless mute, all speak phonetically. 

French, on the other hand, is a breathgroup final-stress language 
characterized by considerable phonetic suppression at the beginning 
of breath groups. This factor, plus a fairly “irregular” spelling con¬ 
vention, underlies the judgement that French is “hard.” Teachers of 
French practically all start out with phonetic rather than spelled 
representations of the words and phrases and later use these symbols 
to show the regular changes that occur in actual speech (permutations 
and replacements). Some examples are: 

j e ne sais pas [X a n a sepa] —> [z o nsepa ] 
jen’iraipas [zanirepa] —> [snirepa] or 

[zirepa] 

je m’en vais [Xomave] —► [smave] 

je viens [zavjc] —>[s;vje] —* [Sfj? ] 
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The same factors operate in English. Following are some examples 
of “sloppy” (i.e., rapid “colloquial” speech) English: 


Formal 
Let us go eat. 

Did you see him? 
Come here! 

Stop complaining! 
John is here! 


Colloquial 
Let’s go eat. 

Did’ya see ’im? 
C’m’ere! 

Quit your bitching! 


Phonetic renditions 
lfzgawiyt —> lesgwiyt 
dijasiyim —► jusiym 
kmiyr 
kwic 0 -bicn 

janzyiyr^ jansyiyr^fanSiyr 


The last example can be interpreted as: John’s here; John’s year; 
John’s ear; John’s cheer; John’s shear; The john is here; John’s sear; 
John’s seer; etc. 


Speed phonemics research would have application in voice tran¬ 
scription, voice transcription aids, voice transcription training, speech 
ciphony and other forms of artificial speech or speech-by-rule, and in 
the electronic processing of recorded speech matter (voice-printing, 
language identification, machine transcription). When the stream of 
speech rather than the syllable patterns within words is examined, 
changes occur in the permitted clustering of phonemes (changes of 
composition and changes of frequency). For example, the initial 
cluster tm- is rare in English words (tmesis), as are tl- (Tlingit), 
gw- (Gwen), kn- (knockwurst) and ts- (tsetse), but are quite common 
across word boundaries in running speech (t’morrow; out’late; “g’wan 
now”; C’n I go?; ’t’s all over now). Indeed there are even some new 
s- initiated 3-phoneme clusters (stl- as in “it is too late” rendered as 
s’t’late; for some people a glottal stop can precede the s-, this forms a 
4 part initial cluster). Similar changes in elision, compression, and 
quality occur with the vowel phonemes; compare the similar feature 
in English and Russian whereby unaccented vowel phonemes are re¬ 
duced to schwa. 

Should I stress what is now the obvious—that the “system” of 
conventions covering hyphenation of spelled English words has little 
correlation to syllable boundaries in speech. The hyphenation con¬ 
vention (with its origins in the field of typesetting and subsequent 
development by editors) is roughly based on morpheme boundaries 
within words along with a series of ad hoc “rules” applied non- 
consistently. Compare a few words hyphenated according to the 
“rules” as shown in a dictionary and according to real speech: 

know-ing [no-(w)ig]; knowl-edge [na-lijj; ac-knowl-edge - 
[ae k-na-lijf or a-kna-lij]; ac-knowl-edg(e)-ment [ae(kp(k)na- 
li-jmint]; lan-guid-ly [leq-gwi-dli or lae-wi-dli]; lank-i-ly 
[lae ij-ki-li or lae i}-kli] 
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The above comparison leads to an explanation of the fact that in 
reading it is sometimes necessary to back up and re-scan a hyphenated 
word—the incorrect phonemic signal was projected by the spelling of 
the portion of the word preceding the hyphen, as in the following 
examples: 

Have you seen my lan- Then he picked up the mus- 

S guage book? readme grape, 

tern? jelymeat. 

ky brother? (ing mouse. 

Let us now examine a typical sentence fragment full of short words 
when all possible word and syllable divisions are marked (despite the 
ad hoc rule about not hyphenating certain short words): 

“It is easy to see why the analyst in using language . . 
it-is-ea-sy-to-see-why-the-a-na-lyst-in-us-ing-lan-guage . .. 

The same speech utterance at 4 progressively faster speeds can be 
represented by a combination of “eye dialect” and phonemes as follows: 

its^ea-sy-to-see-wai-tha-ae-na-lyst-in-yu-zing-laeij-gwidge . . . 
i-tsiy-zi-ta -siy-wai-da-aen-na-lis-tin-yu-zin-lae-wff . . . 
i-tsiy-zi-tsiy-wai-cfaen-na-lis-tin-nyu-zan-lae-wij . . . 
tsiy-zi-tsiy-wai-daen-li-stin-nyu-zn-lae-wic . . . 

There are, of course, many possible variations reflecting regional, social 
and ideosyncratic dialects; thus the phoneme patterns of distribution 
in a given language express this variation in generalized allophonic 
distributional sets per investigated dialect. When the factor of ut¬ 
terance speed is included, it is obvious that “speed phonemics” will 
have to take the phonological syllable as the basic unit of investigation. 
And any type of application of electronics or “machines” to the per¬ 
ception, transcription or translation of oral communication data will 
also have to be based on the acoustic signals—and not on written 
words. 

FUTURE LINGUISTIC RESEARCH 

An analysis as presented in this paper based on language rather than 
on a secondary derivative cultural phenomenon (writing) has implica¬ 
tions for the future. Enormous sums of money have been spent in the 
past 15 years on mechanical translation. In the early days most of 
the money disappeared in attempts to mechanize lexical, morphological 
and phrasal concordance equivalents in some two given languages. 
Much of this work was not directed by competent linguists (mainly 
because there were very few such linguists who could also cope with 
the new realm of cybernetics and computers) in the initial years. The 
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current theoretical work and trial applications in computational lin¬ 
guistics and theoretical machine applications for language work deal 
mainly with the syntactic deep structure of languages. Theoreticians 
are now investigating theories of the semantic component of language. 
Most of this involves the devising of models and theoretical constructs 
which simulate, parallel, or can be said to stand in some sense for some 
one portion of the “black box” processes which take place in the human 
brain as the individual reacts and performs psycholinguistically in a 
sociolinguistic situation in the physical environment in which he lives. 

Linguistics has been at the “lever and pulley” stage for the past 
century, is currently making rapid strides (thanks partly to computer 
technology and its challenges), and should be able to make enormous 
advances when and if electronic devices are designed to handle lan¬ 
guage tasks (rather than pushing and squeezing language into devices 
designed to perform counting tasks based on binary decision pro¬ 
cedures). 

The input and output of the devices of the present and of the future 
will be in the components of surface structure (syllables and phonemes 
“interpreted” as morphemes; derivative letters and written words re¬ 
flecting or representing certain aspects of the phonemes and mor¬ 
phemes). In the fairly near future the electronic input from optical 
scanning of texts in language X will be manipulated in some series 
of programming procedures and be “printed” in language Y (i.e., one 
step more advanced than the current primitive machine translation 
devices which by that time should be a bit more sophisticated). The 
next step will be to use the primary units of speech (syllable and 
phoneme) and attack the problems of transcription of speech. Step 
three will marry the latter technology to the then existing translation 
technology. The first solutions will probably convert recordings of 
sound waves of language X into “written” forms of language Y which 
can be “read” or deciphered by human beings as well as fed into 
various types of information storage units for information retrieval 
as required. Later and still more sophisticated solutions will provide 
devices for sequential and simultaneous interpretation in two or more 
languages. 
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